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RETURN  POLICIES  FOR  AN  INVENTORY  SYSTEM 
WITH  POSITIVE  AND  NEGATIVE  DEMANDS 

Daniel  P.  Heyman 

Bell  Telephone  Laboralories 
Holmdel,  New  Jersey 

ABSTRACT 

We  consider  a single  ilem  inventory  system  with  positive  and  negative  stock 
fluctuations.  Items  can  be  purchased  from  a central  slock,  n items  can  be  re- 
turned for  a cost  R -I-  rn,  and  a linear  inventory  carrying  cost  is  charged.  It  is 
shown  that  for  minimizing  the  asymptotic  cost  rale  when  returns  are  a 
signiflcani  fraction  of  slock  usage,  a Iwo-critical-number  policy  (a,b)  is  optimal, 
where  b is  the  trigger  level  for  returns  and  b - a is  the  return  quantity.  The 
values  for^  and  b are  found,  as  well  as  the  operating  characteristics  of  the  sys- 
tem. We  also  consider  the  optimal  return  decision  to  make  at  lime  zero  and 
show  that  it  is  partially  determined  by  a and  b. 

1.  INTRODUCTION 

For  some  types  of  telephone  central  office  equipment  it  has  been  observed  that  disconnec- 
tions, rearrangements,  and  so  on,  of  the  equipment  occur  frequently  enough  that  net  demand 
(outflow  minus  inflow)  may  be  negative  in  some  time  periods.  This  phenomenon  may  also 
arise  whenever  a supplier  receives  new  and  repaired  items  to  satisfy  customers,  for  example 
military  depots.  For  these  situations,  inventory  models  that  consider  only  positive  demands  are 
inadequate. 

The  negative  demands  cause  the  inventory  level  to  increase,  perhaps  to  undesirably  large 
values.  Thus,  we  are  interested  in  finding  a policy  for  returning  inventory  to  another  storage 
facility  — which  is  the  subject  of  this  paper.  We  will  consider  the  dynamic  behavior  of  a single 
inventory  location  which  has  the  option,  at  any  time,  of  returning  any  part  of  its  stock  to  a cen- 
tral warehouse. 

The  single  inventory  location  model  is  of  interest  in  its  own  right,  but  it  is  also  important 
as  a building  block  for  multiechelon,  multilocation  models.  In  the  telephone  central-office 
application,  many  central  offices  are  supplied  from  a common  central  stock,  which  can  place 
orders  on  an  external  source.  Clearly,  the  return  policies  followed  at  each  central  office  should 
be  jointly  determined  for  optimal  performance. 

In  an  unpublished  report  (31,  the  author  studied  such  a two-echelon,  multilocation  model 
and  found  that  the  jointly  optimized  policies  for  the  second-echelon  facilities  were  approxi- 
mately the  same  as  the  individually  optimized  policies  obtained  from  a single-location  model, 
and  so  were  the  operating  characteristics  of  the  system,  for  reasonable  values  of  the  system’s 
parameters.  Furthermore,  obtaining  the  jointly  optimized  policies  is  computationally  prohibitive 
(except  for  exploratory  research),  so  individual  optimization  provides  the  only  practical  method 

581 


79  04  la  044 


I 


t 

I 


I « 


i 


f 

i 

f 


582  D.  P.  HEYMAN  J 

f 

I 

for  obtaining  return  policies.  The  two-echelon  model  mentioned  above  requires  a large  amount 
of  detailed,  but  standard,  manipulation  to  analyze;  some  numerical  results  obtained  from  it  will 
be  reported  in  this  paper. 

The  traditional  inventory  models  that  assume  that  demands  are  nonnegative  and  do  not 
permit  stock  returns  are  clearly  not  suitable  for  the  problem  considered  here.  The  cash-balance 
inventory  models  (see,  e.g.,  [2]  and  [10])  explicitly  consider  negative  demands  but  also  allow 

the  stock  level  after  a return  to  be  negative,  which  is  not  appropriate  for  our  problem.  If  the  , 

cash-balance  models  are  modified  so  the  stock  level  is  always  nonnegative,  one  can  obtain  , 

optimal  return  policies  for  a single  review-period  model  quite  readily,  but  multiperiod  problems  ^ 
lead  to  intractable  dynamic  programs  111].  Whisler  [14]  studied  a stochastic  inventory  model 
for  rented  equipment.  The  decision  to  reduce  the  number  of  items  rented  in  Whisler’s  model 
corresponds  to  the  decision  to  reduce  inventory  in  our  model,  but  Whisler  assumes  that  no  ord-  1 

ers  can  occur  when  the  stock  level  is  zero  except  between  regularly  spaced  decision  epochs,  * i 

which  is  not  appropriate  for  our  problem.  Our  formulation  allows  a fixed  charge  for  returns  to  , 

be  incorporated  in  the  model  with  no  extra  work;  fixed  charges  are  very  troublesome  to  add  to  i 

Whisler’s  model.  Heyman  l4]  considered  the  problem  of  setting  a maximum  stock  level  when  i 

there  are  negative  demands.  That  model  can  be  interpreted  as  a special  case  of  this  model  ! 

where  the  return  quantity  is  fixed  at  unity.  Hoadley  and  Heyman  (5I  describe  a one-period  i 

model  for  the  problem  discussed  here.  It  has  not  been  found  possible  to  extend  that  model  to 

a dynamic  situation.  ! 

In  section  2 we  formulate  our  model  and  obtain  some  preliminary  results  for  obtaining  the  | 

optimal  long-run  policy.  Results  for  completely  specifying  the  model  as  a nonlinear  program  I 

are  given  in  section  3.  This  nonlinear  program  can  be  solved,  in  integers,  using  an  unpublished 
fixed  point  formulation  due  to  R.  Saigal  112).  Some  numerical  examples  are  given  in  section  4.  [ 

In  section  5 we  formulate  the  problem  as  a Markov-renewal  program  to  study  the  return  prob-  ' 

lem  at  time  zero.  Consequences  of  this  formulation  for  the  infinite  horizon  problems  are  given  j 

in  section  6. 

i;- 

2.  MODEL  FORMULATION 

In  this  section  we  will  describe  the  basic  structure  of  our  model  and  obtain  some  prelim-  [ 

inary  results. 

2.1  Basic  Assumptions 

To  avoid  confusion  between  items  which  are  returned  from  the  field  and  items  which  are 
sent  to  the  central  stock,  we  will  call  the  former  disconnects  and  the  latter  returns.  The  demands 
on  the  facility  will  be  called  connects,  and  items  sent  from  the  central  stock  to  the  facility  will  be 
called  shipments. 

We  will  make  the  following  assumptions  about  costs; 

(i)  The  cost  of  returning  n items  is  /?  + nr. 

(ii)  The  cost  of  shipping  n items  is  ns. 

(iii)  There  is  a holding  cost  rate  of  h per  item  per  unit  time  applied  to  each  item  in  inven- 
tory. 


A fixed  cost  is  absent  in  (ii),  because  a warehouse  activity  should  be  preparing  items  for  ship- 
ment as  a matter  of  course.  The  holding-cost  rate  reflects  the  usual  collection  of  warehousing 
expense,  ad  valorem  taxes,  tied  up  capital,  and  so  forth. 
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We  will  assume  that  the  inventory  facility  always  receives  a shipment  immediately  after 
requesting  it  from  central  stock,  and  that  the  connects  and  disconnects  each  form  a Poisson 
process  with  rates  /u  and  K respectively.  To  avoid  trivialities  we  assume  0 < X.^x  < oo.  These 
Poisson  assumptions  are  necessary  for  the  analysis  given  in  section  3;  useful  approximate 
results  can  be  obtained,  if  the  processes  are  allowed  to  be  arbitrary  renewal  processes,  by  the 
methods  described  in  [4], 

Let  us  temporarily  assume  a two-critical-number  policy  denoted  (.a.b),  with  a and  b 
integers  and  a < b.  The  number  b is  the  "trigger  level"  for  making  a return  and  a is  the  stock 
level  after  the  return  of  size  — a is  made.  In  section  6 we  will  show  that  such  a policy  is 
indeed  optimal.  For  the  present  we  will  concentrate  on  finding  the  optimal  values  of  a and  b, 
denoted  a*and  ft* respectively,  which  minimize  the  asymptotic  cost  rate  of  the  system,  denoted 
by  C(a,b).  Note  that,  by  cost  assumptions  (ii)  and  (iii)  and  the  assumption  of  zero  lead  time 
for  shipments,  only  one  item  should  be  ordered  at  a time  from  central  stock  and  then  only 
when  a connect  occurs  when  the  inventory  is  zero.  We  will  let  C*  — C(a*,ft*). 

2.2  Obtaining  the  Cost  Function  and 
Some  Conservation  Laws 


Let  us  examine  the  inventory  process.  A sample  path  of  the  stock  level  will  have  the 
general  behavior  shown  in  Figure  1.  The  arrows  at  I4  and  signify  that  a disconnect  occurred 
when  the  stock  level  was  ft  - 1,  so  the  stock  level  rose  to  ft  and  then  instantly  was  reduced  to  a 
by  a return.  Note  that  the  stock  level  is  at  ft  for  only  an  instant,  so  ft  — 1 is  the  largest  stock 
level  actually  observed. 


During  the  time  intervals  (to-til  and  ihe  stock  level  is  zero,  so  connects  occurring 

during  these  time  intervals  will  have  to  be  satisfied  by  shipments. 


STOCK 

LEVEL 


TIME 


FIGURE  1.  A typical  realization  of  stock  levels. 
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Let  ttq  “ Tfoia.b)  and  W(,_|  — 7rj,_i  (a,b)  be  the  asymptotic  probabilities  that  the  stock 
level  is  zero  and  <»  - 1 respectively.  Since  the  connects  form  a Poisson  process,  asymptotically, 
the  rate  at  which  connects  occur  when  the  stock  level  is  zero  is  Since  each  such  connect 
generates  a shipment  which  costs  s,  the  cost  rate  for  shipments  is  s/ztto.  Since  the  disconnects 
form  a Poisson  process,  the  rate  at  which  returns  are  made  is  Xrrf,_|.  Since  each  return  costs 
R + (b-a)r,  the  cost  rate  for  returns  is  [/?  + (6-fl)rl\iri,_|.  Let  L - Lia.b)  be  the  average 
stock  level  in  the  steady  state.  Then  bL  is  the  asymptotic  holding-cost  rate.  Thus,  the  total 
cost  rate,  C(a,b),  is 


(2.1)  C(a,b)  — s^two  + [/?  + (6— a)rJX»rj,_i  + hL. 

We  can  simplify  (2.1)  by  establishing  an  equation  relating  wq  and  Tr,,_|.  The  disconnects  ’ 

occur  at  rate  X and  are  used  either  to  satisfy  future  connect  orders  directly  or  are  returned.  The 
rate  at  which  the  former  use  occurs  is  because  that  is  the  rate  at  which  disconnects 

are  satisfied  from  stock  at  the  second  echelon.  The  rate  at  which  the  latter  use  occurs  is 
(/)— fl)Xir(,_|,  because  Xw^^i  is  the  rate  at  which  returns  are  made,  and  each  return  is  of  size 
b-a.  By  conservation  we  obtain  i 


(2.2)  X - (l-wo)^  + (.b-a)(knti.x), 

which  relates  no  to  n^.i.  Thus,  C(a.b)  can  be  specified  once  iro  and  L are  determined.  Sub- 
stituting (2.2)  into  (2.1)  yields 

(2.3)  C(a.b)  - SMW0+  [R  + (b-a)r]lK- (I  - n^fi](b-a)-^  + HL. 

Now  let  us  consider  the  net  demand  rate  (shipment  rate  minus  return  rate)  a second 
echelon  facility  places  on  the  central  stock  By  conservation  it  must  be  /u.  - X;  if  this  quantity  is 
negative,  then  the  facility  returns  more  than  it  demands  in  shipments,  on  the  average. 

- 3.  THE  EVALUATION  OF  wq  AND  L 

In  this  section  we  will  solve  for  ttq  and  L\  substituting  these  results  in  (2.3)  we  obtain 
Cia.b). 

For  a particular  policy  (a,/>),  let  n,  — n,(a,b)  denote  the  steady  state  probability  that  the 
stock  level  is  /,  / - 0, 1,  ...  ,b-l.  Let  p — X/^;  p is  the  average  number  of  disconnects  that 
occur  between  each  successive  pair  of  connects,  and  (1  -t-  p)"'  is  the  probability  that  the  next 
event  (an  event  is  a connect  or  a disconnect)  is  a connect.  From  standard  Markov  process  argu- 
ments (see,  e.g.,  [9,  Chapter  21)  we  know  that  the  rr/s  exist  and  are  the  unique  probabilities 
which  satisfy 

(3.1) 

(3.2) 

(3.3) 

(3.4) 


f 

and 

h-l 

1 

f 

(3.5) 

M 

JS 

1 

/-O 

( 

When  0 “ 6 

- 1 we  ignore  (3.3)  and  use  (3.4)  with  the  convention  wt,  * 0. 

TTj  — (1  +p)  ‘rr^+i  + p(l  +p)“  / — 1, 2,  ...»  b—2,  i ^ a, 

TTo  “ P"'wi, 

iTft.i  - p(l-f-p)"'w(,_2,if  a < b-2. 

Wa  - (l-i-p)"'7r.  + i + p(l-i-p)“'(Wo_,  + Wft.i), 
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To  solve  these  equations  we  first  consider  (3.1)  as  a difference  equation  for  / — 1,2,  , 

a—\  with  an  initial-value  condition  given  by  (3.2).  Solving  in  the  usual  way,  we  get 

(3.6)  iri’-p'itQ,  /— 0, 1,  ....  a. 

Now  we  consider  (3.1)  as  a difference  equation  for  / — a-¥\,  a+2,  ....  b-2  with  (3.3) 
as  a boundary  condition.  The  general  solution  to  (3.1)  is 

(3.7)  ir,  — C]  + c^', 

where  C|  and  C2  are  arbitrary  constants.  In  particular, 

(3.8)  — C|  -I-  C2p'’"* 


(3.9)  ir(,_2  ” Cl  + C2p'’  ^ 
Substituting  (3.9)  into  (3.3),  we  obtain 

(3.10)  Cj  -I-  C2P*“^  "• 

P 

Solving  (3.8)  and  (3.10)  simultaneously  yields 


P-1  p*-'(l-p)’ 


substituting  (3.11)  into  (3.7)  we  get 


(3.12)  IT,  - ^ rr*_|,  / - a -1-1,  a -1-2,  . . . , b-\, 

p-  1 

when  p 1.  For  p — 1,  we  apply  L’Hospital’s  rule  to  (3.12)  and  obtain 

(3.13)  IT,  — «•(,_!,  i — a-l-1,  a-l-2,  . . . , 1. 


Substituting  (3.6)  and  (3.12)  in  (3.4)  we  get 

p^d-p"-')  . p" 

(3.14)  ^*-1  + ^0- 

From  (3.6)  we  also  have 

(3.15)  - pVo. 

From  (3.14)  and  (3.15)  we  obtain 

(3.16)  TTo ^h-\.  P 1. 

p"  '(p-1) 

For  p “ 1, 

(3.17)  TTo  “ (f>-a)ir»_,. 


Together,  (3.6),  (3.12),  and  (3.16)  express  ir,  in  terms  of  iTt,-\  for  / — 0,1,  ...  , b-2. 
With  these  and  (3.5)  we  obtain 

(3.18)  (p  1), 

1 - (A-a)p*(l  -p)  - p*  “ 


(3.18) 
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(3.19) 


‘jr-a)(]+a+b)‘ 


The  steady-state  mean  of  the  process 

*-i 

L = L(a,b,p)  =“  /jT, 


appropriate  equations  for  tt,,  for  example,  (3.5),  (3.11),  (3.15)  and 

ricncc 

<3-20) 

0 a + \ P-1 


i 7r*_|  ff'-J  *-l 

!'>'+  I/p' 

0 p ‘ [a  + l a + | 


for  p 1 . 

The  summations  in  (3.20)  are  standard  geometric  ones,  and  closed  form  expressions  can 
be  obtained  with  a reasonable  amount  of  effort.  Performing  the  summations  yields 


(3.21a) 


^ ='^077^  ll-p‘'ll+a(l-p)])  -I-  ^ 
(l-p)  p-1 


(b+a)(b-a~\) ^ 

[2  (l-p)2 

lp“la(l-p)  -I-  11  -pX-'lp  *(l-p)l)j 


for  a < h - 1 and 
(3.2lb)  I 


l-p*  l-p 


^ (1-P*-'11  + (A-l)(l-p))) 


“ r - I- 

for  a — h - 1.  Equation  (3.21b)  is  the  steady-state  expected  number  of  customers  in  an 
M/M/l  queue  when  the  queue  is  limited  to  * - 1 customers.  By  collecting  terms,  both  (3.21a) 
and  (3.21b)  can  be  written  as 


(3.22) 


^ “ 2(f-p)^  2iro-ir*_,((l-p)(h2_fl2)  + (n.p)(^_a)] 


Corresponding  expressions  for  p - 1 can  be  obtained  in  a similar  manner. 

From  these  results  for  ng  and  L we  observe  that  C{a,b)  is  a nonlinear  function  of  a and 
b.  Although  TTo  and  L depend  on  X and  p only  through  their  ratio  p , Cia.b)  depends  on  the 
magnitudes  of  X and  fi. 

The  cost  function  can  be  minimized,  subject  to  a < ft,  by  a method  due  to  Saigal  [12]. 
This  method  uses  a piecewise-linear  approximation  to  the  cost  function  and  always  produces  an 
integer  answer.  If  the  approximation  is  convex,  the  answer  is  the  global  optimum;  otherwise,  it 
may  not  be  (even  if  the  cost  function  itself  is  convex  ).  In  section  6 we  give  a test  for  global 
optimality  which  can  be  used  to  check  if  the  algorithm  has  produced  the  optimal  solution 


I 
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4.  NUMERICAL  EXAMPLES 

Take  R = $\0,  r = S2/item,  s = S5/item  and  A = 52  per  item  per  month  for  an  item  I 

which  costs  5100  to  purchase.  Different  pairs  of  monthly  connect  and  disconnect  rates  are  | 

used.  The  results  of  some  sample  problems  are  shown  in  Tables  1 to  3.  In  these  tables,  the  | j 

entries  marked  "joint"  refer  to  optimizing  the  three  facilities  in  the  example  Jointly,  including 
their  effect  on  the  central  stock.  The  optimization  was  done  by  using  a nonlinear  programming 
algorithm  of  Saigal  l\2j. 

TABLE  1.  Sample  problem  I 


TABLE  2.  Sample  Problem  II 
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TABLE  3.  Sample  Problem  III 


Facility 

Discon. 

Rate 

Con. 

Rate 

a 

b 

Prob. 

Empty 

Prob. 

Full 

Avg. 

Inv. 

Cost 

Rate 

(S/mo) 

Joint 

1 

19.0 

21.0 

10 

22 

0.116 

0.0019 

5.86 

27.17 

Ind. 

10 

21 

0.118 

0.0023 

5.70 

27.16 

Joint 

2 

19.0 

21.0 

10 

22 

0.116 

0.0019 

5.86 

25.17 

Ind. 

10 

21 

0.118 

0.0023 

5.70 

25.16 

Joint 

3 

21.0 

19.0 

05 

16 

0.057 

0.0133 

6.41 

27.13 

Ind. 

05 

Li^ 

0.057 

0.0133 

6.41 

27.13 

Total  Cost 
Rate 

CPU 

(sec) 

Joint 

78.744 

122.22 

Ind. 

78.749 

0.49 

The  important  feature  of  these  results  is  that,  for  these  data,  individual  optimization  does 
just  about  as  well  as  joint  optimization,  and  the  differences  in  the  operating  characteristics  are 
small.  Furthermore,  individual  optimization  is  done  in  a reasonable  amount  of  computation 
time  and  joint  optimization  is  not. 

Note  that  when  the  disconnect  rate  is  much  smaller  than  the  connect  rate,  e.g.,  facility  1 
in  examples  1 and  2,  two  of  the  alternate  optima  for  individual  optimization  are  displayed. 

5.  THE  TIME-ZERO  RETURN  PROBLEM  AND  A 
DYNAMIC  PROGRAMMING  FORMULATION 

Now  we  turn  to  the  problem  of  the  decision  to  be  made  when  the  stock  control  system  is 
first  implemented.  We  call  this  the  time-zero  return  problem.  Since  we  are  interested  in 
optimizing  the  first  decision,  it  is  natural  to  study  this  problem  with  dynamic  programming;  in 
particular,  we  will  use  the  structure  of  a Markov-renewal  program  (7).  Besides  solving  the 
problem  at  hand,  this  formulation  will  lead  to  further  results  for  the  infinite-horizon  model  stu- 
died in  the  previous  sections. 

The  time-zero  return  problem  is  simply  stated.  Suppose  at  time  zero  the  inventory  level 
is  k and  the  cost  of  returning  n items  at  time  zero  is  /?o  + rgn:  how  many  items  (<  k)  should 
be  returned  to  minimize  the  long-run  costs?  We  allow  R and  ro  ^ r,  but  it  is  not  neces- 

sary. 

S.l  Dynamic  Programming  Formulation 

In  this  section  we  formulate  the  model  as  a Markov-renewal  decision  problem.  This  will 
be  the  cornerstone  of  our  analysis  for  the  decision  to  be  taken  at  time  zero. 

Let  an  event  be  a connect  or  disconnect.  Since  the  connects  and  disconnects  occur  accord- 
ing to  independent  Poisson  processes  with  rates  /x  and  X respectively,  the  time  between  events 
has  a negative-exponential  distribution  with  mean  and  the  probability  that  the  next 


t 
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event  is  a disconnect  is  X/(X+/it).  Since  the  negative-exponential  distribution  has  the 
memoryless  property,  there  is  no  loss  in  generality  of  considering  the  system  only  at  event 
epochs.  Let  the  stare  of  the  system  at  any  time  i > 0 be  the  inventory  level  at  time  i. 


Let  be  the  mean  length  of  time  the  system  stays  in  state  i before  changing  state,  p, , be 
the  probability  that  the  next  state  entered  is  7,  given  that  the  current  state  is  /,  and  d,  be  the 
expected  cost  incurred  during  a visit  to  state  /.  To  facilitate  solving  the  model’s  equations  it  is 
convenient  to  introduce  a maxiniun  inventory  level;  this  does  not  affect  our  previous  results 
nor  the  theory  behind  the  following  results.  Let  A/  < 00  be  the  maximum  state  that  can  be 
entered;  we  can  interpret  N as  the  size  of  the  storage  facility.  Assume  we  use  the  policy  (a,b) 
to  make  decisions  in  those  states  < b and  the  decision  to  return  nothing  in  those  states  / where 
A -E  1 < / < N—\,  and  disconnects  that  arrive  when  the  inventory  is  at  level  N are  freely 
disposed  of.  Then, 


(5.1) 


(5.2) 


1/X,  /-O. 

MX+fi),  /“1, 2 b-\.b  + \ N-\ 


0, 


i~b, 


1/m.  (“M 


d, 


fis/X,  /— 0, 

hi/(k+pL).  /-I. 2 A-l. 6-1-1 N-\ 

R \-(b—a) r,  i—b, 

HNliJi,  i~N, 


(5.3a) 


ft/ 


|x/(X-t-/i)  ^ p,  J — i + I, 
Im/(X-ai)  ^ (I.  7 “ / - 1. 


for  1 < / *$  A/-I,  6, 


(^■3b)  p„i  ••  pi,^  — Pn.n-\  “ I. 

and  p, , “ 0 otherwise.  It  is  clear  that  the  policy  (0,6)  yields  the  single  recurrent  set  of  states 
|0, 1,  . . . ,b\  when  X > 0.  When  6 > 0,  then  b*  can  be  given  an  a priori  finite  upper  bound 
and  we  assume  b*  ^ N—\.  Thus,  none  of  the  pathologies  of  Markov-renewal  programs  can 
haunt  us. 


Let  v*(/)  be  the  total  cost  incurred  by  time  t when  the  inventory  at  time  zero  is  k.  Jewell 
(7,  eq.  (2)1  shows  that,  for  large  r, 

(5.4)  v*(f)  - O -E  w*  -t-  o(r),  k -0,1 iV 

where  C is  the  cost  rate  (as  before,  which  is  independent  of  the  initial  state  if  there  is  only  one 
recurrent  set  of  states)  and  — w^(a,b)  is  the  bias  in  the  total  cost  from  starting  in  state  k. 
Let  w‘ — w,(a*,b*). 

For  a stationary  policy  (i.e.,  one  that  depends  only  on  the  state)  that  results  in  a Markov 
chain  with  transition  matrix  (p, ,)  with  a single  finite  set  of  recurrent  states,  we  have  (7,  eq. 
(12)1 

(5.5)  »>,  + Ci>, d,  + "^pjw, 


f ; 


! 


I 


I 


I 

I 


for  all  states  /. 
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Now  that  we  have  formulated  the  model  as  a structured  dynamic  program,  we  will  show 
why  it  is  useful  for  our  current  problem. 

5.2  Relation  to  our  Current  Problem 


The  key  to  the  solution  to  our  current  problem  is  (5,4).  Since  the  decision  at  time  zero 
cannot  affect  the  cost  rate,  the  best  one  can  do  is  affect  the  bias  terms.  If  at  time  zero  the 
inventory  level  is  A and  /j  > 0 items  are  returned,  the  cost  incurred  by  time  /,  using  the 
optimal  policy,  is 

(5.6a)  Ri^(n)  + nrf)+  C*t  + + o(r), 

where  Sin)  is  the  unit  step  function.  Expression  (5.6a)  is  minimized  by  minimizing 
(5.66)  /*('>)  A R(iS(n)  + nrQ+ 

and  the  value  of  n,  say,  that  achieves  the  minimum  return  quantity  for  inventory  level  k at 
time  zero  is  the  optimal  number  of  items  to  return  when  starting  in  state  k.  Our  remaining 
work  is  to  calculate  the  w/’s. 

5.3  Obtaining  the  Bias  Terms  and  l\in) 

We  shall  divide  our  task  of  obtaining  J\in)  into  three  parts.  First,  let  us  consider  states 
zero  through  b inclusive.  The  h + 1 equations  given  by  (5.5)  contain  6+2  unknowns,  includ- 
ing C.  But  we  can  find  f *by  the  methods  of  section  2,  so  suppose  we  have  done  so.  Now  we 
have  only  6 + I unknowns.  Let  us  write  our  system  of  equations  in  matrix  form;  using  an 
obvious  notation  we  obtain,  for  any  policy  (a, 6), 

IV  + C If  — d + Rw 

or 

(5.7)  H - R)w_^- d^-  Ci^. 

But  F is  a stochastic  matrix,  so  / - F has  no  inverse,  indicating  that  at  least  one  of  the  6 4 1 
equations  is  redundant.  From  (5.3)  or  by  physical  reasoning  one  can  conclude  that  F consists 
of  one  closed  class  of  states  From  the  discussion  in  [7,  p.  9551  we  know  there  is  exactly  one 
redundant  equation  and  any  one  of  the  equations  can  be  discarded.  We  will  choose  the  6"' 
equation  as  the  redundant  one.  To  reduce  the  number  of  variables  by  one  define 

(5.8)  V,  - w,  - Wfl, 

so  that  by  replacing  w,  by  v,  in  (5.5)  and  using  (5.1)  — (5.3)  we  obtain 
(5.9a)  Vo“0. 

(5.96)  dk  - ixsik  + V|, 

(5.9t  ) V,  + dik+fi)  — hi/ik-\-n)  + /7v,^i  + ( “ 1,2 6 — 1. 

This  system  has  a unique  .solution  for  the  relative  bia.ses  v,  and  (5.9c)  can  be  used  to  recur- 
sively compute  y for  / - 2 6. 


One  can  obtain  a closed  form  expression  for  the  v,  by  regarding  (5.9c)  as  a second-order 
difference  equation  with  boundary  conditions  (5.9a)  and  (5.9b).  Using  standard  methods  (see, 
e.g.,  (6,  sections  1.7  and  1.8l)  we  obtain,  for  k ^ n. 


“ «(p  — 1 ) + lit  + y^,  (“0,1 6, 


(5.10) 


V, 
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where 

(5.11fl) 

a = 

(5.116) 

p = 

and 

(5.11c) 

y = 

C/(\—n)  + /j  (X +^t)/[2(A  — 
y = — hf[2(K  — /x)l. 

When  > = n,  the  solution  to  the  difference  equation  can  also  be  found  by  the  same  methods. 
Since  (5.10)  holds  for  any  policy  (a, 6),  it  must  hold  for  the  optimal  policy  (a*,b*). 

To  demonstrate  that  the  (("'equation  is  indeed  redundant,  we  can  substitute  (5.10)  into  it 
and  show  that  we  obtain  an  identity.  The  (("'equation  is 

(5.12)  Vi,  = R + rib— a)  + 

substituting  (5.10)  into  (5.12)  and  solving  for  C we  obtain 

(5.13)  C = ^nToS  + X7rft_|(R  + r(((— a)l  + hL 

where  tto,  7r,,_|  and  L are  given  by  (3.16),  (3.18),  and  (3.22),  respectively.  TjJlus  (5.13)  above 
and  (2.1)  are  identical,  so  (5.13)  is  an  identity.  Notice  that  this  development  emphasizes  the 
fact  that  ( cannot  be  picked  arbitrarily,  but  that  it  must  be  given  by  (5.13)  in  order  for  (5.7)  to 
be  a consistent  system  of  equations. 

The  second  step  in  obtaining  J\in)  is  to  consider  those  states  larger  than  b.  These  states 
are  transient,  because  once  a stale  k,  say,  with  k > f(,  is  entered  no  state  larger  than  b is  ever 
revisited,  and  such  a state  k will  be  entered  in  finite  time  by  a simple  random  walk  analysis. 
Thus  the  return  decision  used  in  these  states  cannot  affect  the  cost  rate.  Since  we  are 
interested  in  comparing  the  effects  of  different  return  quantities  at  time  zero,  it  is  fruitful  to 
compute  the  bias  terms  for  these  states  when  the  decision  used  in  these  states  is  to  return  noth- 
ing, as  we  have  previously  assumed. 

Let  (V,  be  the  bias  term  for  state  /,  and  define 

(5.14)  y,  “ w,  - w^. 

When  no  returns  are  made  in  state  /,  using  the  same  arguments  leading  to  (5.9),  we  obtain 
(5.15u)  y.v  “ 0> 

(5.156) 


('  hN  ^ 

+ yv  i. 

M-  M 


and 

(5.15c) 


y,  + + i I’  ' “ (’  + >.(>+2.  . A'-l- 

K+fj,  K+fi 


Equations  (5.15)  can  be  solved  in  the  same  way  that  (5.9)  were  solved.  The  difference 
equation  solution  is 

(5.16)  y,  - ^2  + ' 

for  p ^ I,  whe'e  fi  and  y are  as  before, 

C+hN 


- - 


-t-/3  -t-  y[(V2-  ((V-l)^l 


(1  - p), 
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and 

I,  - + ^}((p-l)/V  + ll  + y(p^^-  (/v-l)^lj  j (1-p). 

We  have  now  obtained  v„  / — 0. 1 . . . , b,  and  i — h,b  + \,  . . . , A/’;  let  u&  now  extend 

the  domain  of  definition  of  y,  to  / — 0, 1 b — 1.  By  definition  v,  — w,  — wq  and 

y,  - w,  - wv  for  all  /,  so 

(5.17)  - v,  - w,,  - wv  = yo- 
In  particular,  (5.17)  holds  for  / — b,  so 

(5.18) 

hence  >>0  can  be  obtained  from  our  previous  results,  and  so  can  y,  by  use  of  (5.17). 

Let  us  review  what  we  have  just  done.  First  we  found  the  bias  terms  relative  to  wq  for 
i-0, 1,  ...,  tr,  these  are  the  v,’s.  Next  we  found  the  bias  terms  relative  to  for 

/ =“  b,b  + l /V;  these  are  the  y,  's.  Then  tve  used  (5.17)  and  (5.  IS)  to  convert  the  v,  's  into  y,  's 

for  /'  =•  0, 1,  . . . , b.  Now  we  study  fi,(n). 

From  (5.6b)  and  (5.14)  we  obtain 

/*(n)  - R^p(n)  + nr„  + yt  „ + w’^. 

where  >>*  is  j'*  evaluated  for  the  optimal  policy.  Even  though  we  do  not  know  it  is  only  a 
constant  in  /*(«),  hence  /*(«)  can  be  minimized  over  n by  minimizing 

(5.19)  JtgSin)  + nro  + y^’-n,  ' 

and  doing  so  one  will  find  the  optimal  return  quantity  when  the  inventory  level  is  k at  time  j 

zero.  Once  the  >',*’s  have  been  obtained,  a search  of  A:  + 1 numbers  given  by  (5.19)  will  yield  j 

the  optimal  return  quantity.  Thus  we  have  obtained  a simple  algorithm  to  solve  our  problem.  , 

When  there  are  alternate  optima  for  a and  6,  it  is  important  to  know  w^  because  it  and  ' 

(and  hence  the  optimal  return  quantity)  will  depend  on  the  particular  policy  followed.  f 

Je«>'ell  [8]  shows  how  to  obtain  w/^  with  simple  calculations. 

5.4  The  Explicit  Solution  for  an 
Import  .nt  Special  Case 

An  important  special  case  of  the  model  occurs  when  the  movement  costs  are  the  same  at 
all  times,  i.e.,  when  and  Zq  — r.  For  this  case  we  can  partially  characterize  the  optimal 

policy  at  time  zero.  Let  k be  the  initial  inventory  and  /j"  be  the  (optimal  inventory  level  after  an 
initial  return.  We  define  the  function  /by 

/,  - y‘  - ri.  /■  - 0, 1 N. 

From  (5.19)  we  see  that  //minimizes 

(5.20)  RSik-i)  + /, 

for  / ” 0,1 N\  hence,  either  /'/  ••  At  or  //  minimizes  /,  for  i < k.  A partial  characteriza- 

tion of  //is: 
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THEOREM. 

(I)  if  Ar  < b*,  return  nothing,  i.e.,  /**  - k\ 

(II)  if  Ac  = b\  return  down  to  a*,  i.e.,  “ a* 

(III)  if  Ac  > A* and  /,  > /*.  for  all  / > b*,  then 

return  down  to  a*,  i.e.,  — a*. 

As  usual,  a*  and  b*  denote  the  optimal  infinite  horizon  policy,  so  the  infinite  horizon 
solution  partially  provides  the  solution  for  the  returns  problem  at  time  zero. 

To  prove  part  I of  this  theorem,  we  start  by  observing  that  since  (a*,A»*)  is  optimal  for  the 
long  run,  any  state  i,  0 ^ ; < h*,  is  recurrent  (i.e.,  will  be  entered  infinitely  often)  and  if 
returning  a positive  amount  were  optimal  the  first  time  / was  entered  it  would  improve  the  cost 
rate  if  this  amount  were  returned  every  time  state  / were  entered  (this  statement  is  called 
"Howard’s  Improvement  Routine"  and  is  rigourously  established  in  (13,  Theorem  9.21).  But 
this  means  that  (a*,b*)  is  not  optimal,  which  is  a contradiction.  Thus,  returning  nothing  must 
be  optimal  when  the  initial  inventory  is  less  than  b*.  A similar  argument  proves  the  result  for 
/ * b\  i.e.,  part  11.  Note  that  from  (5.12)  we  are  actually  indifferent  between  returning 
b*  — fl*and  nothing  when  the  initial  inventory  is  b*. 

To  prove  part  III,  note  that  part  I implies  a* minimizes  f,  for  / < b*-,  but  since  f,  > 
a*  minimizes  /for  all  /;  hence,  either  /i.* -=  a* or  /i,’  = k.  From  (5.20)  we  see  that,  to  show  that 
a* is  optimal,  we  must  show  that  /?  + /o-  < /*.  Equation  (5.12)  can  be  written  R + 
this,  along  with  the  hypotheses  of  part  III,  yields 

/„•- 

which  completes  the  proof  of  part  III. 

6.  FURTHER  CONSEQUENCES  OF  THE  DYNAMIC 

PROGRAMMING  FORMULATION 

In  this  section  we  will  show  how  the  dynamic  programming  formulation  of  section  5.1  can 
be  used  to  prove  the  optimality  of  the  two-critical-number  policy  we  studied  in  section  3 and  to 
provide  a test  of  the  global  optimality  of  the  solutions  found  by  nonlinear  programming. 

6.1  Optimality  Properties  of  the 

Two-Critical-Number  Policy 

We  will  prove  that  among  the  class  of  stationary  policies,  there  is  an  optimal  policy  of  the 
form  (a,b).  We  start  by  proving  that  an  optimal  stationary  policy  will  make  returns. 

PROPOSITION;  There  is  a finite  state  / such  that  an  optimal  stationary  policy  insists  on 
making  a return  in  state  /,.. 

PROOF;  Suppose  there  were  no  such  state.  Then  returns  would  never  be  made  and  the 
inventory  level  would  fluctuate  exactly  as  the  number  of  customers  present  in  an  A//A//1 
queue,  (see,  e.g.,  (9)  for  the  pertinant  queueing  results).  From  standard  queueing  theory  it  is 
known  that  if  p > 1,  then  /,-<»,  which  is  clearly  not  optimal  because  we  constructed  policies 
with  a finite  cost  rate  in  section  3,  so  /<.  exists  when  p > 1.  For  p < 1,  when  an  optimal  policy 
is  followed,  let  C*be  the  cost  rate,  f/be  lime  required  for  the  inventory  to  drop  from  / to  /-I, 
and  g*  be  the  costs  incurred  by  the  connects  and  disconnects  that  occur  while  the  inventrory 
drops  from  / to  /—I.  From  (5.5)  we  obtain 
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(6.1)  H-;-  hi  + £(g‘)  - c»  e(t;)  + 

for  an  optimal  policy. 

From  the  theory  of  the  Ml  MIX  queue,  EiT')  is  the  expected  length  of  a busy  period,  so 
E(X)  - (/i  - X)-',  / - 1,2 

Choose  a state  j so  large  that 

(6.2)  ihj  - C*)nti  -\)  + Eig")  > R + r. 

From  (6.1)  and  (6.2)  we  see  that  w‘  - w,',|  is  larger  under  the  assumed  optimal  policy  that  it 
would  be  if  one  item  were  returned.  Howard’s  Improvement  Routine  then  asserts  that  the 
assumed  optimal  policy  can  be  improved,  so  that  a contradiction  is  established,  and  the  state  j 
in  (6.2)  is  an  upper  bound  on  i,. 

From  this  proposition  it  follows  that  if  the  inventory  at  time  zero  is  no  larger  than  /, , no 
state  larger  than  /,  will  ever  be  reached.  If  the  inventory  at  time  zero  is  larger  than  /,  , either 
state  ( or  a smaller  one  will  be  instantaneously  reached  by  an  initial  return,  or  state  /;  will  be 
reached  in  finite  time  by  the  evolution  of  the  inventory  level  when  an  optimal  policy  is  fol- 
lowed. Let  6*  be  the  smallest  state  where  the  optimal  decision  in  that  state  is  to  return  a posi- 
tive amount,  and  let  6*  — a*  be  the  optimal  return  quantity.  Then  when  > 0 (as  it  is  for 
any  nontrivial  problem)  state  b*  will  be  reached  by  some  finite  time,  and  thereafter  no  state 
larger  than  6*  will  be  reached.  Thus,  6*  and  a*  will  determine  the  minimum  cost  rate  and  the 
decisions  in  those  states  between  6*  -I-  1,  and  , (a*,b*)  is  optimal  among  all  stationary  poli- 
cies. 

This  result  can  be  strengthened  to  include  the  class  of  all  policies  when  the  maximum 
inventory  level  N is  finite  and  larger  than  In  this  case  the  Markov-renewal  program  has  only 
a finite  number  of  states  with  only  a finite  number  of  choices  in  each  stale,  so  it  has  an  optimal 
policy  that  is  stationary  [7). 

6.2  Global  Optimality  Conditions 

In  section  3 we  found  a*  and  6*  with  a nonlinear  programming  algorithm.  Since  we  did 
not  prove  that  the  function  being  minimized  is,  in  general,  convex,  we  have  no  guarantee  that 
the  algorithm  will  always  find  the  global  minimum.  We  can  use  the  formulation  of  section  5.1 
to  devise  a test  that  will  determine  if  a given  pair  (a.b)  is  optimal. 

Equation  (5.13)  shows  that  the  cost  rates  studied  in  sections  3 and  5 are  identical,  so  the 
solution  found  by  the  nonlinear  programming  algorithm  is  a global  optimum  if,  and  only  if,  it  is 
an  optimal  solution  to  the  dynamic  program. 

Suppose  we  have  a policy  (a.b)  that  we  think  is  optimal.  Associated  with  this  policy  is  a 

cost  rate  C and  a set  of  relative  bias  terms  y„i  —0,1 N.  Let  v,*"’,  </,*"’  and  p/"’  be  the 

expected  duration  of  a visit  to  state  /,  the  expected  immediate  cost  of  a visit  to  state  /,  and  the 
conditional  probability  that  the  next  state  will  be  y,  when  n items  are  returned  in  state  /,  respec- 
tively, /.  y » 0, 1 N and  « — 0, 1 /. 

According  to  Jewell’s  Markov-renewal  programming  algorithm  (71,  the  policy  (a. 6)  is 
optimal  if,  and  only  if, 

(6.3)  [rf/"’  -I-  Ip/r'ji/  - y,]  j v/"'  > C.  n - 0. 1 /. 

for  all  /. 
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so 

If  we  take  N — 50,  (6.1)  would  have  to  be  tested  ^ / — 1275  times  for  each  {a,b) 

thought  to  be  optimal,  with  6 < 49  (policies  with  6 — 50  may  not  be  globally  optimal  and  may 
just  indicate  that  h*  > 50  and  that  the  chosen  bound  of  50  should  be  increased).  Each  test 
involves  only  three  multiplications,  three  additions,  and  one  comparison  once  the  parameters 
are  supplied. 


If  a policy  fails  the  test  there  are  several  courses  of  action  open.  If  the  nonlinear  pro- 
gramming algorithm  that  is  used  is  of  the  kind  that  different  starting  values  may  produce 
different  local  optima,  another  starting  value  could  be  tried  and  tested,  these  iterations  per- 
formed until  the  test  is  passed.  A second  alternative  is  to  solve  the  Markov-renewal  program 
by  using  the  algorithm  in  [7],  for  example.  The  linear  programming  formulation  given  below 
provides  a third  algorithm  for  obtaining  the  global  optimal.  Although  the  linear  and  dynamic 
programs  are  mathematically  identical,  the  availabilty  of  computer  codes  may  make  one  of  the 
two  formulations  easier  to  use. 

6.3  Relation  to  Linear  Programming 

it  is  known  that  Markov-renewal  decision  problems  can  be  formulated  as  linear  programs 
and  the  dynamic-programming-based  algorithms  are  the  simplex  algorithms  with  certain  block 
pivoting  rules  (see,  e g.,  [8]).  Thus,  (6.3)  is  just  the  statement  that,  for  a vector  to  be  optimal 
in  an  LP,  all  columns  must  price  out  nonnegative. 


The  LP  formulation  of  our  problem  is 

» 

minimize  x^y  + i 


subject  to  ^ 8,/  ~ P/r’  ^1  + •'/"’•’f.v+i  ^ 


for  / - 0, 1 iV  and  rt  — n,  — 0, 1 , . . . , /,  where  6, , - 1 if  / - 7 and  is  zero  otherwise.  In 

the  LP,  jTvfi  takes  the  role  of  the  cost  rate  C,  and  x,  takes  the  role  of  the  bias  term 

,v 

w,,  7—0,1 N.  Thus,  the  LP  contains  N + 1 unrestricted  variables  and  ^ (/-El)  ine- 


quality constraints. 


i-Q 


51 

For  /V  - 50,  this  LP  has  ^ — 1325  constraints.  Since  each  of  the  A -E  1 unrestricted 

variables  has  to  be  expressed  as  two  nonnegative  variables,  and  slack  variables  have  to  be  added 
to  each  inquality  constraint  to  apply  the  simplex  method,  this  LP  will  have  1428  nonnegative 
variables.  It  does  not  appear  that  the  LP  formulation  will  work  as  fast  as  the  nonlinear  formu- 
lation, which  has  only  two  variables  and  one  simple  constraint.  It  is  interesting  to  observe  that 
the  problem  of  obtaining  the  optimal  long  run  policy  can  be  formulated  as  either  a nonlinear, 
linear,  or  dynamic  program. 
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ABSTRACT 

This  paper  considcr'i  real-lime  decision  rules  for  an  inventory  system  where 
items  are  repaired  than  "used  up  " The  problem  is  to  decide  which  user  in  the 
system  has  the  greatest  need  for  the  newly  available  inventory  items  coming 
out  of  repair  The  main  result  shows  that  two  published  approahes.  the  Tran- 
sportation Time  Took  Ahead  policy  and  MKTRIC.  are  optimal  when  the 
number  of  users  gels  large  A useful  byproduct  of  the  proof  is  a lower  hound 
on  the  average  backorder  rale  for  a repair-inventory  system  of  any  si/e 

1.  INTRODUCTION  AND  SUMMARY 

We  consider  a repair-inventory  system  where  the  inventory  items  are  repaired  when  they 
fail.  This  contrasts  in  a significant  way  with  the  standard  inventory  model  in  which  a failure  (or 
demand)  means  that  the  item  is  used  up  and  replaced  by  a new  item.  In  the  U.S.  Air  Force 
inventory  system  which  motivates  this  study,  well  over  half  of  the  multibillion  inventory  value 
is  in  repairable  items. 

•Study  of  a repair-inventory  system  consists  of  two  problems:  (1)  How  should  the  inven- 
tory items  be  utilized  in  a .system  of  a repair  center  and  m consumption  centers  (the  distribu- 
tion problem)?  (2)  How  many  items  should  be  purchased  (the  procurement  problem)?  Our 
analysis  is  concerned  with  the  distribution  problem. 

The  model  we  will  analyze  has  been  investigated  by  several  authors,  and  a survey  of  the 
earlier  work  is  included  in  the  paper  of  Simon  [5].  The  most  recent  work  in  the  distribution 
problem  is  that  of  Miller  (2|  which  is  described  in  the  next  paragraph.  The  METRIC  model  of 
Sherbrooke  |4)  is  still  the  standard  for  the  procurement  problem.  The  METRIC  model  is  also  a 
distribution  procedure,  and  this  paper  ends  with  some  positive  results  on  METRIC  for  the  dis- 
tribution problem. 

•This  research  was  pariially  supporicd  by  the  Nalional  Science  Foundalion  under  F.NO  74-13494  and  F.NO  76-12230. 
Musi  of  ihc  resulls  in  Ihis  paper  came  from  the  Ph  D disserlalion  of  Mohammad  Modarres-Ya/di  enlilled.  "Myopic 
ITecision  R.des  in  Ihc  Opiimi/alion  of  Queueing  and  Inventory  Models." 
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The  decision-making  in  the  distribution  problem  is  concerned  with  answering  the  ques- 
tion. "To  which  consumption  center  should  we  ship  the  newly  repaired  item?"  Our  proposed 
solution  to  the  distribution  problem,  the  Transportation  Time  Look  Ahead  policy,  was  intro- 
duced in  the  RAND  report  (1).  It  was  presented  as  a heuristic  policy  which  determines  the 
"neediest"  comsumption  center  (Air  Force  base),  and  sends  the  newly  repaired  item  to  that 
center  from  the  repair  center.  This  policy  was  tested  by  simulation,  and  it  resulted  on  the  aver- 
age in  20  percent  lower  backorder  rate  than  the  METRIC  model  of  Sherbrooke  (4].  Further- 
more, it  was  less  computationally  demanding.  Calculating  an  optimal  solution  to  the  distribu- 
tion problem  is  impossible,  since  formally  it  is  a dynamic  programming  problem  with  many 
state  variables,  representing  the  inventory  level  at  the  consumption  centers,  the  number  of 
items  en  route  to  the  consumption  centers,  the  number  of  items  in  repair,  and  the  number  of 
items  en  route  to  the  repair  center  from  the  consumption  centers. 

A theoretical  justification  for  the  Transportation  Time  Look  Ahead  policy  was  obtained  by 
Miller  12).  It  was  shown  to  be  optimal  for  a modified  version  of  the  problem  whose  main 
assumption  was  that  there  is  no  repair  at  all,  that  is,  every  failure  at  a consumption  center  is 
matched  by  a repaired  item  simultanefously  appearing  at  the  repair  center.  Physically,  this  is 
very  far  from  reality,  of  course,  but  mathematically  it  represented  a decision  problem  similar  to 
the  true  system.  In  this  paper  we  will  show  that  a modified  version  of  the  Transportation  Time 
Look  Ahead  policy  is  optimal  for  the  true  system  when  the  number  of  consumption  centers 
goes  to  infinity.  The  criterion  for  optimality  is  the  average  backorder  rate  per  unit  time  per 
consumption  center  in  an  infinite  horizon. 

The  proof  of  optimality  is  carried  out  by  first  obtaining  a lower  bound  for  the  average 
backorder  rate  using  an  arbitrary  policy.  This  bound  does  not  depend  on  the  size  of  the  system, 
and  therefore  it  is  useful  in  itself  as  we  can  compare  the  backorder  rate  of  a given  policy 
obtained  by  simulation  with  this  lower  bound.  The  second  part  of  the  proof  entails  showing 
that  the  Transportation  Time  Look  Ahead  policy  approaches  this  bound  as  the  number  of  con- 
sumption centers  goes  to  infinity.  This  proof  is  also  used  to  show  that  the  METRIC  distribu- 
tion model  is  optimal. 

2.  THE  MODEL 

The  model  will  be  described  in  general  (non-Air  Force)  terms  for  a single  inventory  item 
The  system  has  a repair  center,  m consumption  centers,  and  N spare  items. 

Each  consumption  center  supports  activities  which  produce  failures  of  the  item  according 
to  a Poisson  rate  X,  of  items  per  day.  This  is  assumed  to  be  the  case  even  when  the  inventory 
shortage  hinders  these  activities,  since  it  is  assumed  that  the  available  units  (aircraft  in  the  Air 
Force  example)  can  be  utilized  at  a higher  rate  to  compensate  for  the  shortage.  In  addition  to 
the  parameter  X„  a second  parameter  is  needed  to  describe  each  consumption  center,  namely 
r„  the  one-way  deterministic  shipping  time  between  the  consumption  center  and  the  repair 
center.  In  order  to  limit  the  detrimental  effect  of  these  inventory  shortages,  the  system 
employs  A spare  units  which  are  u.sed  to  replace  operational  units  when  they  fail. 

When  a unit  fails  at  consumption  center  /,  a spare,  if  available  at  the  consumption  center, 
is  used  to  replace  the  inoperative  unit.  If  no  spare  is  available  at  consumption  center  /,  then  a 
backorder  occurs.  The  failed  unit  is  sent  to  the  repair  center  at  the  instant  of  failure,  and  the 
shipping  time  is  T,.  The  repair  center  repairs  the  items  according  to  a general  distribution  func- 
tion F whose  mean  is  Mp..  There  is  no  queueing  or  other  interaction  between  items  at  the 
repair  center,  so  that  the  repair  center  can  be  thought  of  as  an  infinite-server  queueing  system. 
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When  (he  item  comes  out  of  repair,  we  must  decide  whether  to  keep  it  (temporarily)  in  reserve 
stock  at  a repair  center,  or  to  ship  it  to  consumption  center  /,  l</<m.  The  items  which  are 
held  in  reserve  at  the  repair  center  may  be  sent  at  any  time  to  a consumption  center. 

KXAMPLE  I:  Let  w - 3,  \ = 0. 1 .T,  - 10  days,  X2-0.02.7'2-  3 days, 
A , - 0 3,  r,  - 6.67  days,  A be  exponential  with  a mean  of  7 days,  and  A — 9.  Lei  each  base 
have  3 items  al  lime  0.  A simulation  of  the  system  is  shown  in  Table  1. 


TABLE  I 


Time 

Event 

Consequence 

0.15 

Failure  at  C.C.  3 

Inventory  level  drops  to  2.  Item  will 
arrive  at  repair  center  at  6.82. 

2.37 

Failure  at  C.C.  2 

Inventory  level  drops  to  2.  Item  will 
arrive  at  repair  center  at  5.37. 

4.76 

5.37 

Failure  al  C.C.  3 

Arrival  at  repair 
center 

Inventory  level  drops  to  1.  Item  will 
arrive  al  repair  center  al  1 1 .43. 

6 14 

6.72 

Failure  al  C.C.  1 

Completion  of  repair 
at  repair  center. 
Repaired  item  is 
sent  to  C.C.  3 where 
it  will  arrive  at 
lime  13.39. 

Inventory  level  drops  to  2.  Item  will 
arrive  at  repair  center  at  16.14. 

Let  v,(/)  be  inventory  level  al  consumption  center  / at  time  t.  Our  objective  is  to  find  a 
decision  rule  which  minimizes 

(I)  - — lim  — flT  f s,{w)~dw 

where  x - max(-Ar.O)  and  A' stands  for  expected  value.  The  average  backorder  rate  is  B,  and 
It  depends  on  ihe  initial  slate  as  well  as  the  policy  used. 

Let  x,{i)  be  the  sum  of  s, (r)  plus  the  number  of  items  en  route  to  consumption  center  /. 
Rather  than  use  the  objective  function  (I),  it  is  preferable  to  perform  a time  translation  as  was 
done  in  (2|  and  let  c, (x,  (/)|  be  Ihe  expected  inventory  shortage  at  consumption  center  / at  time 
i + T,,  given  that  the  inventory  level  at  time  i is  x,  (r).  This  term  is  well-defined  since  any  deci- 
sions made  after  lime  i will  influence  shortages  at  consumption  center  / only  after  time  i+T,. 
Thus  c,[x,(r)|  — £'(i,(/+ /;)■),  and  the  objective  function  in  terms  of  c,|x,(/))  is  to  minimize 


I 


I 

i 

! 


I 

I 


I 

A' 
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(2) 


Since  s,{i  + T,)  - where  D,  has  a Poisson  distribution  with  a mean  of  \ T we 

can  calculate  c,[a;(/))  by  ' 


(3) 


c,U,(r)]  = 2^  [.i-x,U)\  p(j\k,T,). 

I *max|  v// ) .01 


where  i){i\k)  - e * A'//!,  the  probability  that  a Poisson  random  variable  with  mean  k equals  i. 

We  conclude  this  section  with  a section  with  an  important  observation  about  the  model. 
Let  U(i)  be  the  number  of  items  en  route  to  the  repair  center  and  in  repair  at  time  r,  the  sym- 
bol U is  supposed  to  suggest  "unavailable."  This  number  is  independent  of  all  of  our  past  (and 
of  course  future)  decisions  of  where  to  send  the  repaired  item.  Define  /1(r)  to  be  the  number 
of  repaired  items  held  at  the  repair  center,  plus  the  items  which  are  en  route  to  the  consump- 
tion centers,  plus  the  inventory  levels  at  the  consumption  centers;  the  symbol  A is  used  to  sug- 
gest available.  Then  /I  (t)- number  of  items  held  at  the  repair  center.  The  basic 
accounting  equation  is  = N. 

THEOREM  I:  Let  the  initial  condition  be  U(0)  = 0.  (i)  For  r>  0,  U(t)  has  a Poisson 
distribution  with  an  increasing  mean  of  where  k=T  k Pit)  - - f Y 

Y (1-G,(w)]rfw,  and  G,  is  the  distribution  function  of  T,  plus  the  repair  time 

Ue..G,(t)  - F(t-T,)].  0/)U  = lim  EUit)  k,iT,  + 1/^),  where  1/m  is  the  mean  of  the 
repair  time. 

PROOF.  The  result  follows  with  minor  modification  from  a known  result  with  an  ir, finite 
server  queue  (see  Ross  I3,p.l81  for  example). 

COROLLARY:  A = lim  El/4  (t))  exists. 


3.  THE  MAIN  RESULT  FOR  THE  DISTRIBUTION  PROBLEM 

This  version  of  the  Transportation  Time  Look  Ahead  policy  begins  with  the  marginal 
analysis  problem: 

min^  f,(jf,) 

1-1 

m ^ 

subject  to  ^ x,  “ A,  x,  an  integer. 
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The  c,  are  defined  by  (3),  and  -4  is  defined  by  the  corollary  to  Theorem  1.  (Since  /t  = N-U.X 
is  an  integer  only  if  (/  = ( 7;  + X/fi)  is  an  integer,  which  is  not  generally  the  case,  of 

. -I 

course.) 

j^n  this  section  we  will  choose  simplicity  over  generality  in  three  specifics.  We  will  assume 
that  /I  is  a nonnegative  integer.  If  this  is  not  done  wc  must  resort  to  terminology  such  as  the 
greatest  integer  less  than  A,  which  lengthens  all  the  arguments.  Second,  for  some  of  our 
results  we  will  assume  a convenient  initial  state  for  the  model.  Third,  we  will  let  the  system  get 
large  in  a convenient  way. 

The  problem  (4)  can  be  solved  by  a marginal  allocation  scheme  beginning  with 
= 0,  / = /,..., w,  since  the  c,  are  discretely  convex.  The  optimality  conditions  are  that 

<5)  max[c,(V)-c,(x,*  + 1)]  < min[c, (x;*-l)-c,  (x,*)) 


and  ^ X,  = A,  where  x,‘,  / = 1,  . . . , m,  are  the  maximizing  values. 

Example  2:  We  keep  the  same  parameters  as  Example  I and  calculate  xi'.xj,  and  xj.  As 
(/  = 7,/l  = 2 . The  optimal  allocation  is  x,*  - l.xj  = 0,  and  xj  =1.  It  can  be  verified  that 
max|c,(x,‘)-c,  (x,*  + I )J  = 0.594  for  / = 3 and  minfc,  (x,‘)-c,  (x,‘-l))  = 0.6221  for  / =1. 

' I 

The  Transportation  Time  Look  Ahead  policy  makes  decisions  when  an  item  comes  out  of 
repair  and  when  there  is  a failure  at  a consumption  center.  When  an  item  comes  out  of  repair, 
we  determine  the  consumption  center,  say  center  k,  which  maximizes  {c,(x,(r)l-c,(x,(/)  + lli 
over  consumption  centers  / such  that  x,(/)<x,,  and  send  the  item  to  consumption  center  k.  If 
•<■,(/)  ^ .V,  for  all  i,  then  the  item  is  held  at  the  repair  center.  The  term  myopic  can  be  applied 
to  this  rule  since  the  c,  are  the  immediate  cost  functions. 

When  a demand  occurs  at  consumption  center  /,  we  check  if  both  (a)  x,(r)<x,‘  and  (b) 
there  is  at  least  one  item  held  at  the  repair  center.  We  send  an  item  from  the  repair  center  to 
consumption  center  / if  and  only  if  both  these  conditions  hold.  Of  course,  if  both  conditions 
hold,  then  x,  (r)  > x,  for  all  j ^ i.  One  property  of  the  policy  is  that  if  x,  (0)  < x'  for  all  /, 
then  X,  (r)  ^ x'  for  all  / and  r. 

The  first  half  of  our  proof  of  the  optimality  of  the  Transportation  Time  Look  Ahead  pol- 
icy is  concerned  with  showing  that  for  all  policies  and  all  starting  states  satisfying  U{Q)  = 0, 

m 

the  average  backorder  rate  B is  greater  than  or  equal  to  f,  (jf,*)-  The  idea  of  the  proof  is  that 

(-1 

Theorem  1 ensures  that  regardless  of  the  policy  employed  the  long-run  average  of  items  avail- 
able is  A,  and  when  A items  are  available  the  minimum  value  of  the  cost  rate  is  ^ c, (x,’). 

/“I 

Let  r[x(r)l  - ^ f,lx,(r)]  and  C(x')  - c,(x,‘). 

-I  ,-i 

LEMMA  I:  Let  A be  any  value  between  the  right-hand  side  of  (5)  and  its  left-hand  side. 
If  A(i)  - A -)-  n,  then  C(x(/)i  ^ C(x')-n  A . 


602 


B.  L.  MILLER  AND  M MODARRES-YAZDI 


I 

I 


PROOF;  Consider  the  following  variation  of  (4) 

m 

min  z * ^ i*, 

/-I 

m ^ 

subject  to  ^ ■¥  n,  y,  an  integer. 

(-1 

m 

The  value  of  this  problem,  z\  will  be  less  than  or  equal  to  Cfjf(/)]  when  ^ 
_ 1-1 
x,(f )</!(/)  ~ A + n,  since  the  x,(t)  are  a feasible  solution  to  the  problem.  The  inequality 
z*  > C(x*)—n  A follows  from  the  discrete  convexity  of  the  c, . (Q.E.D) 

Let  C(t)  be  the  expected  value  of  C'lxlf))  conditioned  only  on  fy(0)=0, 

LEMMA  2;  For  any  decision  rule, 
f(f)  > C’(jf*)+  {A-EUii)]]^. 

PROOF:  C(f)  “X  CU(/)|/I(/)  - A + n]  PlAii)  = A + n\ 

n 

^ Lieu*)-  ^ n]  P\A{l)  = A + n] 

n 

by  Lemma  I 

= C(x*)+ MA-E[A(i)]].  (Q.E.D) 


THEOREM  2:  For  any  decision  rule  with  a starting  state  satisfying  (/(O)— 0, 

B ^ — C(x’). 
m 

PROOF.  By  the  corollary  to  Theorem  1,  given  any  e > 0 there  is  a i,  such  that  for  all 
!>  t ,,A-E\A(i)\  . We  have 


— lim  — £ r C{w)dw 

/ — oo  ^ *^0 

f 

\ ' 

— lim  — 1 r * eftf)  j/rv  -b  r Ciw)  d>v 

7 

1 

> lim  - f (C(x‘)  + A-EIAU)]  A )dw 

/ oo  ( t 

1 

1 

>C(x‘)  - € A . 

(Q.E.D) 

i * 

4 

The  second  part  of  the  proof  consists  of  showing  that  the  Transportation  Time  Look 
Ahead  policy  is  optimal  when  the  system  gets  large,  which  we  define  in  the  following  way. 

We  index  on  the  sequence  of  problems  which  are  getting  large  by  A,  where  A represents 
the  number  of  replications  of  m consumption  centers.  That  is,  for  problem  A there  is  one 
repair  center  and  km  consumption  centers,  with  A consumption  centers  described  by  the  param- 


RECOVERABLE  INVENTORY  ITEMS 


603 


I • 

elers  ( A , T^),k  consumption  centers  described  by  the  parameters  ( A 2,7”;^,  • • . , and  k con- 
sumption centers  described  by  the  parameters  . We  let  , and  B*  refer  to 

those  terms  as  applied  to  problem  A,  so  that  i/'  = = N.a'  = A,  and  S'  =»■  fl  as  A = I 

represents  the  origmal  problem. _We  set  kN.  _Jhe  replication  characteristics  of  problem  A 
imply  that  f/*  - kU.  Therefore  -4*  - k(N—U)  = kA. 

By  Theorem  2 the  value  of  AwB*  is  greater  than  or  equal  to 

t 

A m I 

min  X,  5^  : 

/-I  .-I 

subject  to  ^ ^ = kA,  a„  an  integer, 

> / 

where  c,  ,(•)  - c;  for  / = 1 A.  It  is  not  hard  to  see  that  the  solution  is  jcJ  = x’,  satisfying  I 

the  optimality  conditions  (5),  and  consequently  that  B*  ^ — C’lx"). 

in 

LE'MMA  3:  Let  Z*(/)  = [U^(i)-U''V  . Then  £Z*(/)  < 

PROOF:  Since  l-UHi)  < for  all  / > 0,Z*(r)  ^ {UHO-f-lUH/)]]'  and  ZHi)^  ^ 

{UHi)-k:[UHi)]]\  We  have  [BZ'(/)F  ^ B[Z‘(/)^]  < k:\UHi)- f-:\UHi)]y  = For  f/HO 
= A A /B(r)  ^ AT/,  by  Theorem  I.  Q.li.D. 

TIIFOREM  3:  Let  Bf  be  the  average  cost  when  there  are  A replications  and  we  are  using 

the  Transportation  Time  Look  Ahead  policy.  Then  H.  < — [('(T")  -I-  A '''  f/'^’l 

in  • 

t 

1 

PROOF:  We  assume  that  the  initial  stale  is  jc,^(0)  = x',  I < / < w,  1 < < A , and  that 

k{N-A)  items  are  held  in  reserve  at  the  repair  center.  With  this  inUial  state  the  Transportation 
Time  Look  Ahead  policy  satisfies  .ir„(r)  =■  x'  whenever  T^(/)  > T*.  This  implies  that  when 

T‘(/)  ^ T*  we_incur  a cost  rate  of  kC(X  ) and  A*'(/)—A^  items  are  held  at  the  repair  center.  . 

When  then  the  cost  rate  is  less  than  or  equal  to  kC(x')  -t-  A''-A''{r),  since  t',(x,)  - ■ 

c, (x,  -E  1)  ^ I for  all  / and  x,  . Thus  the  cost  rate,  given  AHi)  , is  less  than  or  equal  to  ! • 

A(  (x  ) -E  By  Lemma  3,  L'‘(/)  ^ kC(x')  -E  for  all  t.  Therefore 

BJ  > — (<-(x')  + A . 

I- 

j 

COROLl.ARY:  Let  8 be  an  arbitrary  policy  and  B^,  the  average  cost  using  8.  Then  ' if. 

lim  Bi-Bi  < 0.  f 

A oo  j 

I 

PROOF:  Combine  Theorem  2 and  Theorem  3. 

The  prool  ol  Theorem  3 does  not  use  the  fact  that  when  A(/)<A  we  send  an  item  to  the 
consumption  center  which  maximizes  [(^(x.j-c  (x,  -E  1)|.  it  only  requires  that  we  send  it  to  a 
center  which  has  x,(r)  < x,' . 

The  result  does  require  that  the  proper  stock  levels  be  used.  Suppose  that  we  use  the 
Transportation  Time  Look  Ahead  policy,  but  set  x,,  - .v,,  I < / < A , where  ^v,  = A , and 

('(.V)>('(X*)  The  average  cost  of  this  policy,  B*  , is  greater  than  or  equal  to  — C(T),  so 

in 

that 
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lim 

A — 

Theorem  3 also  shows  that  the  MtTRlC  dislribuiion  model  is  optimal  and  we  outline  the 
argument. 

The  METRIC  distribution  model  lakes  the  kN  inventory  items  and  allocates  them  to  the 
repair  center  and  the  km  consumption  centers.  Eor  r/ equals  0,1,  ...  , kN,  METRIC  allocates  >1 
items  to  the  repair  center  and  kN—d  items  to  the  consumption  centers.  The  allocutions  to  con- 
sumption center  / are  based  on  a Poisson  demand  oC  (the  paragraph  below  liq.  (5j  o/  .Sher- 
brooke [4]) 

(6)  K(T,  + M(n) 

where  Mil)  is  the  average  number  of  days  an  item  is  backlogged  at  the  repair  center,  where  </ 
items  are  slocked  at  the  depot,  and  equals  (Hqs.  (4-5)  of  Sherbrookel 

X (i-il)/>ii\kU)/k  A 


Once  the  allocation  which  minimizes  the  sum  of  backorders  at  the  consumption  center  is  deter- 
mined, METRIC  operates  in  time  by  shipping  to  the  consumption  centers  in  the  order  demands 
occur. 

As  k goes  to  infinity  and  d is  .set  equal  to  kU,  A (d)  ^oes  to  /.era.  The  remaining  k.i 
items  are  allocated  to  consumption  centers  in  the  same  way  as  the  Transportation  Time  Look 
Ahead  policy  (Eq.  3 and  4)  since  the  marginal  analysis  is  based  on  (6)  which  is  the  Poisson 
demand  A,7',.  METRIC  satisfies  the  requirement  that  items  are  sent  to  centers  such  that 
x,(l)>x‘.  Therefore  Theorem  3 also  shows  that  METRIC.'  is  optimal. 


ACKNOWLEDGEMENTS 

We  thank  Steve  Lippman  for  making  a number  of  suggestions  on  the  presentation. 


REFERENCES 

(ll  Miller,  B.,  "A  Real  Time  Metric  for  the  Distribution  of  Serviceable  Assets,"  RM-5687-PR, 
The  RAND  Corporation,  Santa  Monica,  California,  (1968). 

[2]  Miller,  B.,  "Dispatching  from  Depot  Repair  in  a Recoverable  Item  Inventory  System:  On 
the  Optimality  of  a Heuristic  Rule,"  Management  Science,  21  316-325,  (1974). 

[31  Ross,  S.,  Applied  Probability  Models  with  Optimization  Applications  (Holden-Day,  San  F'ran- 
cisco,  1970). 

(4)  Sherbrooke,  C.,  "METRIC:  A Multi-Echelon  Technique  for  Recoverable  Item  Control," 
Operations  Research  16,  122-141  (1968). 

(51  Simon,  R.,  "Stationary  Properties  of  a Two-Echelon  Inventory  Model  for  Low  Demand 
Items,"  Operations  Research  19  761-773  (1971). 


i 


I 

1 


i 


I 


I 


PREDICTING  THE  COST  PERFORMANCE  OF 
INVENTORY  CONTROL  SYSTEMS  BY 
RETROSPECTIVE  SIMULATION 


i 


Alastair  MacCormick* 

Deparlmeni  of  Management  Studies 
University  of  Auekland,  New  Zealand 


ABSTRACT 

When  an  inventory-conlrol  system  is  designed  to  utilize  statistical  demand 
information  in  computing  an  (s,S)  policy,  it  may  be  necessary  to  use  perfor- 
mance forecasts  to  calibrate  the  relationship  between  control  parameters  and 
performance  measures.  Using  a simulation  model  to  test  predinion  by  retros- 
pective simulation  we  estimated  the  bias  and  variance  of  forecasts  for  expected 
costs  and  other  operating  characteristics.  The  results  show  the  sensitivity  of  the 
prediction  bias  to  changes  in  demand,  cost,  and  lead  lime  parameters  and  to 
choices  made  in  system  design 


When  a new  inventory  control  system  is  installed,  the  design  process  typically  involves 
forecasting  the  system’s  future  performance.  We  investigate  the  statistical  accuracy  of  forecasts 
made  by  employing  retrospective  simulation.  We  examine  the  bias  and  dispersion  in  these 
forecasts  and  how  they  vary  with  system  settings,  such  as  the  length  of  the  demand  history 
used  for  policy  revision  and  forecasting,  and  with  environmental  specifications,  such  as  the 
underlying  demand  process,  costs,  and  replenishment  lead  time.  Extensive  tests  of  the  retro- 
spective forecasting  technique  are  made  with  a computer  simulation  model. 

I.  PERFORMANCE  FORECASTS  IN  INVENTORY  SYSTEM  DESIGN 

We  assume  that  the  analyst,  in  designing  an  inventory-control  system,  has  selected  a class 
of  decision  rules.  .Specifically,  we  consider  the  (s,  S')  class  of  reorder  point,  reorder  level  rules. 
The  analyst  also  must  choose  a routine  to  calculate  the  values  of  s and  .S',  and  these  computa- 
tions depend  on  the  specification  of  the  demand  process.  If  the  available  empirical  information 
about  demand  is  only  a historical  sample  with  a limited  number  of  previous  observations,  usu- 
ally the  analyst  must  settle  for  a computational  approach  that  provides  an  approximately  optimal 
(minimum  expected  total  cost)  statistical  policy,  since  optimal  statistical  policies  are  available 
for  only  a very  restricted  class  of  situations  (e.g.,  (9)).  We  investigate  approximately  optimal 
statistical  (.v,  .S')  policies  that  employ  the  normal  distribution,  as  one  simplifying  assumption, 
and  estimates  ol  the  demand  mean  and  variance  computed  from  recent  demand  history. 

In  inventory  systems  an  important  trade  off  is  between  inventory  investment  and  service, 
that  is,  out-of-stock  performance.  In  the  classic  (.v,  .S)  model  ([I],  [2]),  this  trade  off  is 
influenced  by  the  relative  values  of  the  unit  holding  and  penalty  cost  parameters,  the  latter 

•C  urrently  Visiting  Professor  at  Graduate  School  of  Busine.ss  Administration,  University  of  North  Carolina  at  Chapel 
Hill,  Chapel  Hill,  North  Carolina 
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often  being  specified  judgmentally.  In  fact,  the  assessment  of  the  appropriate  penalty  cost  value 
may  be  done,  partly,  in  view  of  its  impact  on  inventory  investment.  For  this  and  related  rea- 
sons, the  analyst,  prior  to  implementing  an  inventory-control  system,  wishes  to  predict  the 
system’s  performance  for  one  or  more  specifications  of  the  parameter  settings  employed  in  the 
computational  process. 

For  an  (.v,  S)  type  statistical  system,  the  various  approximations  that  are  employed  in  the 
computations  make  it  highly  questionable  to  use  probability  forii'.ulas  to  provide  accurate  values 
for  the  desired  system  performance  forecasts  of  inventory  investment,  replenishment  fre- 
quency, service,  and  corresponding  costs.  Therefore,  the  analyst  usually  resorts  to  retrospective 
simulation.  That  is,  the  analyst  reuses  the  limited  amount  of  past  data,  from  which  the  demand 
mean  and  variance  are  estimated  and  employed  in  the  is,  S)  computations,  to  predict  (simu- 
late) how  the  chosen  rules  will  perform  in  the  future.  On  the  basis  of  the  resultant  statistical 
forecasts,  the  analyst  may  adjust  the  penalty  cost  or  other  parameters.  The  trade  off  proce.ss  via 
retrospective  simulation  typically  is  carried  out  with  aggregates  for  one  or  more  groups  of 
inventoried  items. 

I 

The  common  practice  of  employing  retrospective  simulation  to  provide  forecasts  of  future  | 

system’s  performance  raises  the  following  questions. 

1.  How  good  are  the  statistical  predictions  of  the  system’s  future  operating  characteristics? 

Are  the  forecasts  biased?  What  is  their  variability?  | 

i 

2.  How  does  the  accuracy  of  retrospective  simulation  forecasts  depend  on  the  amount  of 

historical  demand  information  available?  On  the  system’s  parameter  specifications?  i 

On  the  demand  environment? 

For  a simplified  one-period  inventory  model,  MacCormick  (6)  showed  that  forecasts  by  ' 

retrospective  simulation  do  have  prediction  biases  (for  example  costs  are  underestimated),  and 
analyzed  the  sensitivity  of  the  bias  to  underlying  model  specifications,  such  as  mean  demand, 
cost  parameters,  and  length  of  demand  history.  i 

Here  we  treat  a multiperiod  environment  and  the  complexity  of  the  system  necessitates  ^ 

using  computer  simulation  (MacCormick  (7],  Estey  and  Kaufman  [4])  to  investigate  the  ques-  [ 

lions  posed  above.  We  describe  the  inventory  model  in  the  next  section  and  the  design  for  the  j 

simulation  experiment  in  Section  3.  Subsequently,  in  Section  4,  by  extensive  simulation  test-  [ 

ing,  we  analyze  the  properties  of  the  forecasts.  * 

/ 

2.  A SINGLE-ITEM  MODEL  ? 

« t 

Before  recognizing  the  actual  statistical  environment  that  the  system’s  designer  faces,  we 
describe  an  "ideal"  single-item  model  that  motivates  the  approach.  Consider  the  (.s,  .S')  model  I 

given  in  Veinolt  and  Wagner  [10):  every  period  the  stock  level  (on  hand  plus  on  order)  is 
reviewed.  Demand  in  each  period  is  assumed  to  be  independently,  identically  distributed,  with 
mean  fi  and  variance  Stockouts  are  backlogged  and  eventually  filled,  and  replenishments 
arrive  a known  fixed  lead  time  A periods  after  ordering.  There  is  a fixed  ordering  cost  Co,  for 
each  replenishment  action.  (The  purchase  cost  is  directly  proportional  to  the  quantity  ordered. 

Since  the  optimality  criterion  is  in  terms  of  expected  cost  per  period  and  all  demand  is  eventu- 
ally satisfied,  this  cost  component  can  be  omitted  from  further  consideration.)  The  charges 
applied  to  period-end  inventory  and  backlog  are  linear,  with  unit  holding  cost  Cm  and  unit 
backlog  penally  C„u,,  respectively.  In  any  period  the  lime  sequence  of  events  is  review  and 
order,  delivery,  and  demand.  This  stationary  model  assumes  an  unbounded  horizon. 
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Under  the  above  ideal  assumptions,  an  optimal  rule  is  of  the  (s,  S)  form;  if,  on  review, 
inventory-on-hand-and-on-order  x is  below  s,  an  order  is  placed  for  a replenishment  quantity  of 
S—x  units. 

Wagner  |12l  gives  an  algorithm  to  compute  approximately  optimal  (s,  5);  it  is  adapted 
from  Robert's  |8l  algorithm  by  assuming  that  the  upper  tail  of  the  lead  time  demand  distribu- 
tion is  approximated  by  the  normal  distribution  and  incorporating  a heuristic  modification  pro- 
posed by  Wagner,  O’Hagen  and  Luno'L  ll3]  for  situations  of  frequent  replenishment.  Let  P(  ) 
be  the  standardized  cumulative  normal  distribution,  and  /(-)  be  the  standardized  normal  loss 
function: 

U 

Piu)  “ (27r)"'^^  J*  e~'^^^dx, 

— oo 

oo 

/(m)  “ J(x  — u)e~’‘^^^dx. 

U 

Further  let  the  Wilson  lot  size  be  denoted  as: 
a = (2/x 

and  let  — (X-(-l)fi  and  o-^  — (X-t-Do-^  denote  the  mean  and  variance  of  demand  over  the 

lead  time  plus  one  period  (the  interval  between  reviews).  Values  for  u and  v are  set  by  the  ( 

solutions  to  i 

/(«)  - 

P(v)  - -E  Q„,).  : 

If  Q«lp  is  greater  than  1.5,  the  (s,  5)  pair  is  determined  by  s - -t-  and  S - s -I-  Q„\  : 

otherwise,  ! 

s “ + minimum  {uv^.w^),  ! 

S'  “ Ma  + minimum  (mo-^  + Q„  vo■,^). 

With  discrete  demands,  the  values  for  sand  5 are  rounded  to  the  nearest  integer.  ^ 

The  above  approximately  optimal  values  for  s and  S utilize  only  /x  and  in  specifying  the 
demand  distribution.  The  system’s  designer,  therefore,  can  adapt  the  formulas  to  a statistical  i 

environment  by  employing  estimates  of  n and  tr^  based  on  the  available  demand  history.  t 

I' 

Now  we  elaborate  on  the  design  approach  in  the  statistical  environment.  Suppose  that  the  P 

analyst  has  available  n periods  of  demand  history,  and  believes  that  the  next  n periods  of  r 

demand  will  be  drawn  from  the  same  underlying  demand  process  (at  least  approximately).  i 

(Actually,  n may  be  a design  choice  variable,  and  would  reflect  the  analyst’s  view  of  the  limited  | 

relevance  of  historical  data.)  The  analyst,  then,  statistically  estimates  /x  and  from  the  past  n 

periods  and  calculates  the  (s,  5)  rule  for  the  next  n periods.  To  forecast  performance  over  the  ‘ 

next  n periods  by  retrospective  simulation,  the  analyst  initializes  the  system  using  the  current 
value  of  inventory-on-hand-and-on-order,  and  operates  the  system  with  the  calculated  (s,  S) 
rule  and  the  previous  n demands. 

The  system,  when  implemented,  would  operate  in  this  fashion  over  an  extended  length  of 
time,  that  is,  every  n periods,  the  latest  n demands  would  be  utilized  to  calculate  (s,S)for  the 
next  n periods. 
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3.  EXPERIMENTAL  DESIGN 

The  forecasting  method  just  described  has  been  tested  for  a large  number  of  different 
specifications  of  the  inventory  model. 

Table  I exhibits  the  full  factorial  design  adopted  to  specify  simulation  inputs.  There  are 
648  combinations,  all  of  which  were  tested.  If  unit  time  is  one  week,  the  demand  history 
length  values  of  13,  26,  or  52  correpond  to  quarterly,  semiannual,  or  annual  intervals. 


TABLE  1.  Inpu;  specifications:  a full  factorial  design 


Factor 

Levels 

Number  of 
Levels 

Demand  distribution 

Mean  — p. 

Variance  — cr^ 

Negative  binomial  (trVM“9) 
Negative  binomial  ((rV^i“3) 
Poisson 

3 

Mean  demand  (m) 

2,4,8,16 

4 

Unit  holding  cost  (C|„) 

1 

1 

Unit  backlog  penalty  cost  (C„„,) 

4,9,99 

3 

Replenishment  setup  cost  (Cr,) 

32,64 

2 

Replenishment  lead  time  (X) 

0,2,4 

3 

Demand  history  length  (n) 

13,26,52 

3 

For  each  of  the  648  cases,  we  replicated  the  retrospective  forecasts  200  times  to  derive  the 
comparisons  between  actual  and  predicted  values  of  several  operating  characteristics,  including 
expected  costs  and  backlog  frequency.  In  each  replication,  the  sample  values  for  the  actual  and 
predicted  performance  characteristics  are  averages  over  the  n periods  of  that  replication.  In 
estimating  actual  expected  costs  and  service  performance,  we  made  use  of  the  conditional 
expectation  method  of  variance  reduction  (Ehrhardt  13)).  To  provide  a set  of  initial  conditions 
for  each  replication,  we  employed  ergodic  statistical  theory  in  the  experiment  as  follows.  For 
each  case,  we  let  the  200  replications  comprise  a single  historical  time  sequence.  That  is,  the 
(s,S)  policy  values  were  computed  at  periods  n+1.2n  + l,3«  + l,  ...  , and  the  retrospective 
forecasts  were  based  on  the  initial  inventory  position  at  each  of  those  periods.  Thus,  the  entire 
set  of  replications  for  a case  represents  200rt  consecutive  time  periods.  We  treated  the 
sequence  of  200  measurements  on  each  operating  characteristic  as  a covariance  stationary 
series,  and  derived  estimates  for  the  sampling  variance  by  using  an  autoregression  model 
(Appendix  B of  Fishman  (51). 

4,  FORECASTS  BY  RETROSPECTIVE  SIMULATION 

Before  presenting  summary  results  for  the  648  cases  specified  in  Table  1,  we  examine 
detailed  results  for  a single  case  to  get  initial  insight  to  the  questions  posed  in  section  1.  The 
case  is 
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Negative  binomial  demand  {(r^lfi~9) 
Mean  demand  m “ 16 
Unit  backlog  cost  C„u,  - 99 
Replenishment  set  up  cost  C,,,  - 64 
Replenishment  lead  time  X - 4. 


Observe  that  each  parameter  is  at  its  largest  value  in  Table  I.  For  this  single  special  case, 
we  employed  1200  replications,  instead  of  only  200  Figure  I shows  five  different  operating 
characteristics  On  each  plot  the  horizontal  axis  represents  the  demand  history  length  n.  Nine 
different  lengths  are  tested,  ranging  from  6 to  52.  An  asterisk  (•)  designates  the  estimate  of 
the  actual  /(-period  expected  value  of  the  operating  characteristic;  the  corresponding  cross  (x) 
shows  the  estimated  average  forecast  value  from  retrospective  simulation.  Flach  estimate  is  an 
average  from  1200  w-period  replications.  (Table  A1  of  the  Appendix  provides  these  estimates 
of  the  expected  value,  along  with  the  standard  deviation  of  the  /(-period  average  operating 
characteristic.)  The  prediction  bias  is  represented  by  vertical  differences  between  the  pairs  of 
curves  in  each  plot  of  Figure  1. 

The  horizontal  line  on  each  plot  shows  the  level  of  the  expected  value  of  the  operating 
characteristic  when  control  is  by  the  normal  approximation  policy  (Wagner  (121),  with  the 
underlying  demand  mean  16  and  variance  9x16  used  to  calculate  the  (s,S)  parameters.  Asymp- 
totically, the  actual  and  forecast  characteristics  for  the  statistical  version  tend  to  this  value  as 
the  demand  history  length  increases,  assuming  that  demands  from  consecutive  pairs  of  n 
periods  are  drawn  from  the  same  underlying  distribution. 

For  each  of  the  operating  characteristics,  Figure  1 shows  that  the  actual  expected  value  is 
systematically  underestimated  by  a retrospective  simulation  forecast.  The  bias  is  negligible  for 
inventory  quantity  and  replenishment  frequency,  even  for  n as  small  as  6 periods.  But  the  bias 
is  severe  for  the  backlog  quantity  and  frequency,  even  for  n as  large  as  52  periods.  Most  of  the 
prediction  bias  for  expected  total  cost  per  period  can  be  traced  to  the  bias  in  the  forecast  of 
backlog  quantity.  For  each  characteristic,  the  bias  becomes  smaller  with  increases  in  the 
demand  history  length.  With  demand  history  length  set  at  13  periods,  the  value  of  expected 
total  cost  is  179%  of  the  expected  value  of  the  forecast.  Likewise,  the  value  of  expected 
period-end  inventory  is  101%,  backlog  frequency  is  1174%,  expected  backlog  is  2848%,  and 
replenishment  frequency  is  101%,  of  the  respective  expected  forecast  value.  With  demand  his- 
tory length  at  52  periods,  the  corresponding  ratios  are;  expected  total  cost  122%,  period-end 
inventory  100%,  backlog  frequency  197%,  expected  backlog  273%,  and  replenishment  frequency 
100%. 

Note  that  the  results  in  Figure  1 pertain  to  the  expected  values  of  the  actual  and  fore- 
casted /7-period-average  operating  characteristics.  It  is  also  of  interest  to  determine  how  accu- 
rate a single  forecast  may  be,  by  examining  the  distribution  of  the  forecast  error.  To  discuss 
accuracy,  we  distinguish  two  values  as  possible  objects  of  the  forecast.  When  making  a forecast 
for  the  short-run  performance  of  a policy  being  evaluated  for  one  item,  the  object  is  to  forecast 
the  exact  //-period-average  value  which  each  operating  characteristic  will  take  when  the  policy  is 
implemented  over  the  next  n periods.  In  the  second  case,  we  envisage  an  ensemble  of  an  arbi- 
trarily large  number  of  items  with  identical  underlying  specifications,  no  interdependence  of 
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demand  and  initial  inventory  positions  randomly  sampled  from  an  equilibrium  probability  distri- 
bution. The  object  of  the  forecast  is  to  predict  from  the  n-period  history  of  an  arbritrarily 
chosen  item  the  ensemble  average  or  expected  operating  characteristics.  By  ergodic  theory, 
these  expected  values  can  be  estimated  as  the  long  run  operating  characteristics  obtained  for 
one  item  by  time  averaging  when  the  policy  is  revised  repeatedly  every  n periods  in  a stationary 
environment.  This  second  case  will  yield  some  insight  to  performance  prediction  for  inventory 
systems  with  many  items. 

To  measure  the  accuracy  of  a single  forecast,  for  each  operating  characteristic  we  con- 
structed from  the  1200  replications  a sample  distribution  of  the  forecast  error,  expressed  as  a 
percentage  relative  to  the  forecast.  Tables  2 and  3 show  these  distributions  at  selected  values  of 
the  error  percentage.  Consider  first  the  error  distributions  for  n-period  average  total  cost. 
When  the  object  is  a short-run  forecast  for  one  item  and  the  history  length  is  13  periods.  Table 
2 shows  an  estimated  64.4%  chance  that  the  actual  13-period-average  total  cost  will  be  outside 
the  range  10%  below  to  10%  above  the  forecast  value  with  odds  of  51.2  to  13.2  (approximately 
4 to  1),  that  the  forecast  will  be  on  the  low  side  of  that  actual  value.  When  the  history  length 
is  52  periods,  the  chance  that  the  forecast  of  the  52-period-average  total  cost  will  be  in  error  by 
10%  or  more  of  itself  is  62.5%,  much  the  same  as  for  n=  13,  but  the  odds  for  an  underestimate 


TABLE  2.  Percentage  error  distribution  for  forecasts 
of  short-run  operating  characteristics 

Table  2 shows  empirical  cumulative  distributions  of  the  percentage  difference,  measured  rela- 
tive to  the  forecast,  between  the  forecast  and  actual  n-period-average  values  realized  for  each 
operating  characteristic.  The  distributions  are  each  obtained  from  1200  replications  for  the  sin- 
gle item  case  having  specifications:  negative  binomial  demand  {p.  = 16,  rrfp  = 9);  C„„,  = 99. 
fix  = 64,  C|n  = 1 and  X = 4. 


Operating 

Characteristic 

Demand 

History 

Length 

(n) 

Percentage  of  Eorecast  by  which  Actual 
Value  is  Underestimated  is  at  Least: 

Forecast 

Exactly 

On 

Target 

■ H 

Percentage  of  Forecast  by  which  Actual 
Value  is  Overestimated  is  at  Least: 

100%j 

50')(i 

20% 

10% 

0 

0 

10% 

20%. 

50% 

100% 

■ ■ ■ ■ 

Average 

13 

14.8 

19.2 

34.9 

51.2 

70.3 

29.7 

13.2 

05.1 

0.8 

I'olal  C'osI 

52 

12.3 

19.4 

29.2 

41.7 

63.7 

36.3 

20.8 

12.6 

1.5 

Average 

Period-End 

13 

3.6 

25.3 

39.3 

51.8 

48  2 

36.5 

26  3 

4.9 

Inventory 

52 

07.9 

24.1 

51.6 

48.4 

25.7 

09.3 

0.3 

Average 

Backlog 

13 

22.3' 

22.4 

22.5 

22.5 

22.5 

75.2 

02,3 

02.3 

02.3 

02.2 

02.2^ 

Quantity 

52 

36.7' 

37.6 

38.0 

38.2 

38.6 

32.7 

28.7 

28.7 

28.4 

27.1 

24.3’ 

Backlog 

13 

21  9' 

21.9 

21.9 

21.9 

21.9 

75.9 

02.2 

02.2 

02.2 

02.1 

2.1’ 

Frequency 

52 

35.7' 

36  2 

36.3 

36.3 

36.3 

38.0 

25.7 

25.7 

25.7 

25.6 

23.5’ 

Replenishment 

13 

4.5 

12.4 

31.7 

31.7 

31.7 

36.3 

32.0 

32.0 

32.0 

03.4 

Frequency 

52 

0.04 

10  1 

21  1 

39.7 

19  8 

40.5 

23.2 

04.9 

(1)  (.See  ruble  4).  A zero  forecast  was  followed  by  a positive  n-period  average  backlog  in  (21.1%)  of  (be  1200 

replications  for  n “ 13,  and  in  332  (21.1%)  of  the  replications  for  n — 52. 

(2)  (See  Table  4).  A backlog  was  predicted  but  did  not  eventuate  in  25  (2.1%)  of  tbe  1200  replications  for  n — 13,  and 

in  282  (23  5%)  of  the  replications  for  n - 52. 
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TABLE  3.  Percentage  error  distribution  for  forecasts 

of  expected  operating  characteristics  ^ 

Table  3 shows  empirical  cumulative  distributions  of  the  percentage  difference,  meas- 
ured relative  to  the  forecast,  between  the  forecast  and  the  expected  value  of  each 
operating  characteristic.  The  distributions  are  each  obtained  from  1200  replications 
for  the  single  item  case  having  negative  binomial  demand  (p.  = 16,  ir^/p  = 9); 

‘'’ou.  ” 99.  C'fi,  - 64,  C,„  - I and  A - 4, 


Operating 

Demand 

History 

I'orecast  tindcrestimates  by 
at  Least 

Forecast  Overestimates  by 
at  Least 

(n) 

KXrKi 

50'»l. 

20'k, 

lO-M. 

0 

0 

IW 

2(VM. 

50% 

loo'll. 

13 

396 

81  4 

96  8 

98  2 

99.0 

01  0 

006 

IK)6 

0 1 

Total  Cost 

52 

00.2 

23.7 

72.7 

82  4 

88  2 

118 

08  0 

04.7 

0 3 

Period-End 

13 

(K)  8 

07  9 

30.3 

42  8 

57  2 

42  7 

27.7 

15  1 

0 2 

Inventory 

52 

09  3 

27.7 

55  3 

44.7 

19  5 

03  8 

Backlog 

13 

97.6* 

98.2 

98.4 

98.7 

98.7 

01  3 

(K)8 

00  7 

0.6 

Quantity 

52 

79.0* 

82  8 

86.5 

87.3 

88  9 

III 

09.9 

08  4 

3 6 

Backlog 

...  . 

13 

96.3* 

96.3 

96.3 

96.3 

96  3 

03.7 

03  7 

03.7 

3.7 

Frequency 

52 

00 

59.8 

59.8 

59  8 

87  0 

13  0 

13.0 

‘ 130 

3 2 

Replenishment 

13 

01.8 

12.0 

12.8 

49.6 

49  6 

50.4 

50.4 

08  8 

Frequency 

52 

02.8 

18.7 

50.4 

49.6 

03  5 

*(See  Table  4)  Zero  backlog  was  prcdicled  in  115S  (96.3%)  of  the  1200  replications  for  n — 13.  and  in 
718  (59.8%)  of  the  replications  for  n — 52. 


are  only  about  2 to  1.  Table  3 demonstrates  in  similar  fashion  the  errors  when  the  object  is  to 
forecast  the  ensemble  average  total  cost.  With  the  13-period  demand  history  the  chance  the 
forecast  is  in  error  by  10%  of  itself,  or  more,  is  estimated  to  be  98.8%,  an  underestimate  being 
almost  certain.  With  as  many  as  52  demands  in  the  history,  the  chance  is  still  high  at  90.4%, 
and  the  odds  for  underestimation  are  about  10  to  1. 

Turning  to  operating  characteristics  which  form  the  components  of  total  cost,  there  is  sub- 
stantial dispersion  in  the  forecasts  for  the  average  period-end  inventory  and  the  replenishment 
frequency,  but  it  is  the  prediction  error  in  the  backlog  quantity,  weighted  by  the  unit  backlog 
penalty  of  99,  which  is  the  main  source  for  the  bias  and  dispersion  observed  in  the  forecasts  of 
average  total  cost.  Even  with  52  periods  of  history,  the  chances  of  a short-run  forecast  (Table 
2)  being  10%  of  itself  or  more  in  error  are  49.8%  for  inventory,  66.9%  for  backlog  and  44.3% 
for  the  replenishment  frequency,  and  when  forecasting  expected  values  (Table  3),  the 
equivalent  chances  are  47.2%,  97.2%  and  22.2%.  ^ ./ 

4 

We  found  wide  dispersion  in  percentage  forecast  error  to  be  very  common  in  the  648  • ^ 

cases  of  Table  1.  Indeed  the  dispersion  is  so  great  that  little  reliance  can  be  given  to  a single 
forecast  made  from  the  demand  history  for  one  item.  This  conclusion  applies  with  particular  ^ 

force  to  forecasts  of  backlog  performance,  which  we  next  examine  in  greater  detail,  before  | 

considering  whether  an  aggregation  of  single-item  predictions  can  be  useful  for  predicting  sys-  i 

terns  performance.  j 
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A reliable  calibration  of  the  important  trade  off  between  inventory  investment  and  backlog 
performance  can  be  made  only  if  there  is  an  accurate  method  to  forecast  backlogs.  But  inven- 
tory control  systems  are  designed  to  make  backlogs  relatively  rare  events:  indeed  for  the  special 
case  we  have  been  using  for  illustration,  when  the  demand  history  length  is  13  periods,  on  the 
average  a backlog  actually  occurs  about  once  in  24  periods;  with  a history  length  of  52  a backlog 
happens  about  once  in  43  periods.  Using  demand  samples  with  history  lengths  so  short  relative 
to  the  average  times  between  backlogs,  it  is  hardly  surprising  that  it  is  very  difficult  to  predict 
correctly  the  backlog  frequency,  let  alone  the  magnitude  of  any  backlogs.  Table  4 illustrates 
this  problem  for  backlog  frequency.  As  an  example  from  Table  4(a),  which  shows  results  for 
1200  replications  with  the  demand  history  length  set  at  13  periods,  consider  the  126  in  which  a 
backlog  actually  occurred  once  in  13  periods.  In  only  9 of  these  126  replications  was  the  one 
backlog  predicted  correctly.  Of  the  remaining  forecasts,  116  predicted  no  backlog  and  one 
predicted  two  backlogs.  The  nonzero  off-diagonal  entries  in  Table  4 represent  replications  in 
which  the  forecast  did  not  correctly  predict  the  actual  frequency,  and  show  that  when  backlogs 
occurred  there  was  little  chance  of  an  accurate  prediction.  Indeed,  for  the  case  n=52,  a chi- 
square  test  on  the  sample  distribution  (aggregated  to  5 rows  and  4 columns)  fails  to  reject  the 
hypothesis  that  forecast  and  actual  frequencies  are  statistically  independent,  at  a level  of 
significance  of  90'/n. 


TABLE  4.  Sample  frequency  distributions  for  actual  and  forecast  backlog  frequency 

I — number  of  periods  with  backlog  during  n-period  retrospective  simulation. 

J ~ number  of  periods  with  backlog  during  subsequent  n periods. 

(a)  HISTORY  LENGTH  n-13  (b)  HISTORY  LENGTH  n-52 


1-0 

1 

2 

3 

— 

Total 

J-0 

902 

19 

4 

2 

927 

1 

116 

9 

1 

126 

2 

48 

2 

50 

3 

40 

2 

42 

4 

19 

1 

20 

5 

16 

1 

1 

18 

6 

7 

1 

8 

7 

1 

1 

8 

5 

2 

7 

9 

10 

11 

1 

1 

Total 

1155 

37 

6 

2 

1200 

f 
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We  consider  now  whether  single  item  forecasts  can  be  made  more  useful  for  systems 
predictions  by  aggregation  over  items.  Typically,  a representative  sample  of  items  from  the  sys- 
tem is  employed  to  make  aggregate  forecasts.  As  more  items  are  included  in  this  sample,  pro- 
vided there  is  no  interdependence  between  items  and  all  have  the  same  sign  for  forecast  bias, 
the  bias  of  the  aggregate  forecast  increases  at  a rate  faster  than  the  standard  deviation  of  the 
aggregate.  Given  a sufficiently  large  number  of  items,  correction  for  bias  will  reduce  appreci- 
ably the  mean  square  error  of  the  system  forecast.  Since  the  bias  may  be  sensitive  to  the 
parameters  (cost,  demand,  lead  time)  for  each  item  in  the  sample,  and  will  impact  the  aggregate 
accordingly,  we  next  explore  the  sensitivity  of  the  bias  in  forecasts  aggregated  for  different 
groups  of  items. 

We  examine  9 groups  of  72  items  formed  by  partitioning  the  648  cases  in  Table  1,  so  that 
each  item  in  a group  has  the  same  demand  distribution,  variance-to-mean  ratio,  and  histosy 
length.  In  Table  5,  we  show  the  extent  to  which  prediction  bias  is  sensitive  to  the  demand  his- 
tory length  and  the  demand  distribution.  To  form  the  72  items  aggregates,  we  have  used  cost 
components,  which  facilitate  sensitivity  comparisons.  As  we  saw  in  the  single-item  case  above, 
the  bias  decreases  as  the  history  length  increases.  By  comparing  rows  in  the  table,  we  see  that 

TABLE  5.  Bias  in  Forecasts  of  Aggregate  Operating 
Characteristics  by  Retrospective  Simulation 


Each  actual  and  forecast  value  is  a sample  average  from  200  replications,  obtained  by  partition- 
ing the  648  items  specified  in  Table  1 into  9 groups,  each  with  a given  demand  distribution  and 
demand  history  length  and  by  aggregating  within  each  group. 


Demand  History  Length  n 

n - IJ 

n 26 

n - 52 

Actual 

Forecast 

Bias 

Actual 

Forecast 

Bias 

Actual 

Forecast 

Bias 

Value 

Value 

(%) 

Value 

Value 

(%) 

Value 

Value 

(%) 

Negative  Binomial  Demands 
,r^/M  - 9 

Molding  C'osi 

1874 

1822 

(02  8) 

1817 

1788 

(01  6) 

1804 

1790 

KM)  8) 

backl()g  C'osi 

1562 

280 

(82.0) 

1279 

458 

(64  2) 

1080 

598 

(44  6) 

Replenishment  Cost 

670 

658 

(01  8) 

682 

674 

(01  2) 

683 

679 

KM)  61 

Total  Cost 

4105 

2760 

(32  8) 

3778 

2920 

(22  7) 

3567 

Mm 

(14  0) 

Backlog  I requeney 

108 

.070 

(34  6) 

KM) 

080 

(20  1) 

093 

083 

(11  5) 

Negative  Binomial  Demands 

- i 

Holding  Cost 

1290 

1271 

(01  5) 

1274 

1262 

((Ml  9) 

1270 

1265 

KM)  5) 

Backlog  C'osi 

564 

186 

(669) 

457 

242 

(47  1) 

401 

289 

(28  01 

Kcpicnishmcni  C‘osi 

744 

737 

(01  I) 

750 

745 

((M)6) 

750 

747 

KM)  3) 

Total  ('ost 

2598 

2194 

(15  6) 

2480 

2249 

(09  3) 

2421 

2.301 

(05  01 

Backlog  I retiueney 

101 

082 

(23.7) 

094 

083 

(13  2) 

092 

086 

(06  91 

I’oisson  Demands 

- 1 

Holding  C ost 

1024 

1019 

((H)  6) 

1020 

1018 

((8)3) 

)())9 

1017 

KM)  2) 

Backlog  Cost 

245 

133 

(45  9) 

207 

149 

(27  9) 

190 

162 

(146) 

Replenishment  Cost 

775 

771 

(00  5) 

777 

776 

((K)  2) 

777 

776 

100  1) 

Total  C ost 

2046 

1922 

(06  0) 

2005 

1943 

(03  1) 

1986 

1956 

(01  5) 

Backlog  t requency 

09  .t 

082 

(13  0) 

090 

084 

(06  51 

088 

085 

(03  0) 
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the  bias  decreases  as  the  group  demand  variance-to-mean  ratio  decreases.  Most  of  the  bias  in 
total  cost  is  attributable  to  the  backlog  component. 

Tables  A2,  A3,  and  A4  of  the  Appendix  further  split  each  group  of  72  items  to  show  the 
sensitivity  of  the  prediction  biases  to  other  system  parameters.  The  bias  in  the  forecast  of 
expected  average  total  cost  increases  with  increases  in  Cou,  /C,„,  replenishment  lead  time  X and 
demand  m.  hut  decreases  with  The  prediction  bias  is  large  for  the  backlog  component 

of  expected  total  cost,  but  negligible  for  the  holding  and  replenishment  components.  More 
information  about  the  properties  of  aggregate  forecasts  is  available  in  MacCormick  (61  and 
Estey  and  Kaufman  (4). 

We  conclude  that  forecasts  made  by  retrospective  simulation  have  prediction  biases,  lead- 
ing to  underestimates  of  operating-characteristic  expected  values.  The  underestimation  is  par- 
ticularly significant  for  backlog  performance  measures.  The  bias  is  due  to  the  double  use  of 
historical  demand  information  in  the  statistical  approximation  policy:  first  to  set  the  system 
(s,S)  parameters,  and  then  to  forecast  the  subsequent  performance.  Further,  there  is  consider- 
able dispersion  in  the  distribution  of  forecast  errors  for  individual  predictions.  Hence,  the 
system’s  designer  should  be  most  cautious  in  making  system’s  predictions,  especially  with 
regard  to  service  performance  and  the  trade  off  between  inventory  investment  and  service.  The 
results  in  this  paper  suggest  that  predictions  bases  on  only  a small  sample  of  items  may  have  a 
large  error.  Thus,  the  analyst  is  advised  to  be  generous  in  selecting  the  number  of  items  to  use 
in  a system’s  design  test.  Similarly,  the  analyst  is  advised  to  use  as  long  a demand  history 
length  as  is  available  and  sufficiently  representative  of  the  future  demand  environment. 
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additional  operating  CHARACTERISTIC  DATA 


Contains  the  detailed  operating  characteristic  data  obtained  for  the  single  item 

‘"“"■•V  -OdUiona,  sensi.IvU, ‘dau'^ 
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TABLE  A1 . Dependence  of  actual  and  forecast  operating  characteristics 
on  size  of  demand  sample  used  at  revision 


Demand 

Sample 

Si/c 

n 

Esiimalcd  Means  and  Siandard  Dcvialions,  Aggregated  over  n Periods,  ol 

Total  Cost 

Period-End  Inventory 

Period-End  Backlog 

Backlog  Frequency 

Replenishment  Frequency 

Actual 

Forecast 

Actual 

Forecast 

— 

Actual 

Forecast 

Actual 

Forecast 

Actual 

Forecast 

6 

206  4 
(456  4) 

93.1 
( 31.7) 

79.7 

(44.5) 

78  2 
(29.5) 

1.127 

(4.760) 

0.002 

(0.040) 

0.0546 

(0.1559) 

0.0006 

(0.0096) 

0.236 

(0.139) 

0 230 
(0.124) 

K 

191  5 
(336  4) 

92  9 
( 29.8) 

77.2 

(40.5) 

75.4 

(25.7) 

0.993 

(3.530) 

0.014 

(0.148) 

0.0503 

(0.1323) 

0.0020 

(00164) 

0.251 

(0.114) 

0.251 
(0  099) 

13 

168  0 
(238  9) 

93.6 
( 26.5) 

74.7 

(32.5) 

73.9 

(19.1) 

0.769 

(2.526) 

0.027 

(0.196) 

0.0411 

(0.1002) 

0.0035 

(0.0194) 

0.268 

(0.073) 

0.266 

(0.067) 

IK 

157  3 
(165  4) 

94.3 
( 23  4) 

74.5 

(28.8) 

73.9 

(17.1) 

0.6S8 

(1.771) 

0.030 

(0.155) 

00359 

(0.0747) 

0.0041 

(0.0158) 

0.275 

(0.056) 

0.273 

(0.049) 

24 

143.7 
(124  1) 

97.6 
( 29  7) 

74.3 

(24.6) 

73.8 

(14.2) 

0.523 

(1.342) 

0.063 

(0.259) 

0.0297 

(0.0628) 

0.0064 

(0.0181) 

0.276 

(0.046) 

0.275 

(0.040) 

30 

138  4 
(110.2) 

98  8 
( 28  0) 

74.0 

(22.5) 

73.7 

(12.8) 

0.470 

(1.198) 

0.074 

(0.244) 

0.0272 

(00543) 

0.0074 

(0.0175) 

0 278 
(0039) 

0 277 
(0  032) 

36 

134  0 
(100  6) 

101  8 
( 32.1) 

74.0 

(20.2) 

73.8 

(114) 

0.426 
(1  094) 

0.1  ()4 
(0290) 

0.0250 
(0  0466) 

0.0091 

(0.0176) 

0 278 
(0035) 

0.277 

(0028) 

42 

131  4 
( 92  2) 

103  6 
( 31  9) 

74  1 
(18.9) 

73.9 

(10.8) 

0.398 

(1.005) 

0.120 

(0.290) 

00235 

(0.0421) 

0.0100 

(00175) 

0.279 

(0032) 

0 278 
(0.026) 

52 

127.8 
( 79  5) 

104  8 
( 29.2) 

74.2 
(17  3) 

73.9 
( 9.6) 

0.361 

(0.874) 

0.132 

(0.266) 

00217 

(00372) 

0.01 10 
(0.0158) 

0.279 

(0029) 

0 278 
(0.022) 

Mode!  Spt^tifuaiions: 

Demand  distribution: 
Cost  parameters; 
Replenishment  lead  lime: 


Negative  binomial  with  n 


99, 


C - 64,  C 

ft*  ir 


16, 

1; 


9; 


i 

r 

f 

1 

I 
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TABLE  A2.  Percentages  by  which  forecast  underestimates  expected  costs  per  period, 
for  various  classifications  of  12  items  under  statistical  control 
(Demand  History  Length:  13  Periods) 


■ 

Input  Parameters 

C ('(iniponcnJ 

( /C 

oui  in 

/(■ 

m 

Lead  lime  A 

Mean  DemamI 

M 

9 

99 

32 

64 

0 

2 

4 

2 

4 

8 

If) 

Negative  Binomial  Demands 
.fV/i  - 9 

Holding  ( ost 

2 8 

5 8 

3 6 

10 

3 2 

2 4 

0 2 

2 3 

4 7 

1 7 

4 1) 

3 0 

3 0 

Backlog  Cost 

82  0 

45.3 

65  7 

94  0 

82.7 

81  4 

58  9 

83  2 

89  8 

87  8 

84  U 

80  3 

78  3 

Kcpienishmeni  Cost 

1 8 

1 5 

1 9 

2 0 

14 

2 1 

2 8 

1 7 

0 9 

4 2 

2 7 

1 3 

0 9 

lolal  Cost 

32  8 

14  9 

196 

46  1 

34  3 

31  4 

17  8 

33  5 

40  3 

40  5 

V \ 

31  6 

27  8 

Negative  Binomial  Demands 
trV/i  “ ,1 

Holding  Cost 

1 3 

3 5 

2 1 

0 2 

1 8 

1 3 

0 2 

1 3 

2 6 

18 

I 7 

1 4 

1 4 

Backlog  Cost 

6h  9 

.30  7 

53  0 

69  5 

69  3 

64  5 

37  7 

65  4 

77  8 

75  1 

73  8 

66  7 

577 

Replenishmeni  Cost 

1 1 

1 0 

0 9 

0 9 

1 1 

13 

10 

0 9 

2.2 

I (1 

1 1 

0 6 

l olal  Cost 

15  (1 

8 0 

10  7 

23  1 

17  8 

13  7 

5.7 

14  9 

22  7 

20  8 

18  9 

15  8 

11  5 

Poisson  Demands 

tr^/fi  * 1 

Holding  Cost 

0 6 

1 6 

0 6 

0 0 

0 4 

0 7 

0 2 

0 2 

1 2 

0 8 

0 8 

0 5 

0 4 

Backlog  ( ost 

45  9 

18  0 

36  1 

81  7 

50  6 

41  4 

16  6 

44  0 

58  2 

53  5 

49  5 

44  5 

42  2 

Replenishment  ('ost 

0 5 

0 3 

0 7 

0 6 

0 7 

0 4 

0 7 

0 4 

0 4 

1 4 

10 

0.5 

0 0 

I ota)  Cost 

6 0 

3 5 

4 4 

. 

9 0 

7 3 

5 0 

1 7 

10  0 

7 5 

7 0 

5 9 

5 1 

i 
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TABLE  A3.  Percentages  by  which  forecast  underestimates  expected  costs  per  period, 
for  various  classifications  of  72  items  under  statistical  control 
(Demand  History  Length:  26  Periods) 


Input  I’tirumclers 

Cost  Componcni 

! 

Fouil 

c-  /C 

out  in 

1 ead  time  K 

Mean  Demand 

4 

9 

' 

99 

.72 

64 

0 

2 

4 

2 

4 

8 

16 

Negative  Hinomial  Demands 

ll()klin)£  ('osi 

If) 

.7  1 

2 1 

0.6 

19 

1 .7 

0 0 

1 5 

26 

1 7 

1 8 

1 5 

1 5 

Kdcklo^  Cost 

f)4  2 

28  0 

44,9 

79  0 

65.7 

62  7 

.77  1 

6 .7  0 

76  .7 

74  4 

68  .7 

59  8 

57.4 

Kcplenishnicnl  Cost 

1 2 

1 (1 

1.4 

1 .7 

0 9 

15 

1 .7 

1 5 

0 9 

4 2 

1 4 

0 8 

0 4 

lolal  Com 

22  7 

8.7 

12  4 

.74  0 

24  8 

20  9 

10.7 

22  2 

.70  .7 

.74  5 

27  0 

20  .7 

16  8 

Ncgiilivc  Hinomial  Demands 
tr^/fx  — .1 

ilotdin>!  C<»st 

(IT) 

2 1 

10 

0 4 

10 

0 9 

0 2 

0 9 

15 

1 .7 

0 9 

0.9 

0 8 

Baeklott  (*osl 

47  1 

17  .7 

.72.5 

71  5 

50  1 

44  1 

19  .7 

45,7 

59  6 

58.7 

46.0 

44  9 

43  2 

Replenishment  Cost 

0 6 

0 6 

0 7 

05 

0.5 

0 6 

0 6 

0 5 

0.7 

19 

0 6 

0 2 

0.7 

Total  ('osl 

9.7 

4 4 

5 8 

14  8 

10  9 

8 1 

2 8 

9 .7 

14  1 

14.9 

9 .7 

8.6 

7.7 

Poisson  Demands 

tr'/fj.  — 1 

Holding  Cost 

0 .7 

09 

0 2 

-0  1 

0 .7 

0.2 

0 1 

0 .7 

0 .7 

0.4 

0 4 

0 .7 

0 1 

Racking  Cost 

27,9 

7 9 

21  4 

62  4 

.712 

25  1 

5.7 

27.,7 

.78  5 

,74  4 

27  6 

27,2 

26  .7 

Replenishment  Cost 

0 2 

0 2 

0 2 

0,1 

0 1 

0 2 

0..7 

0.2 

0 0 

0 7 

0 4 

0 0 

-0  1 

Total  Cost 

.7.1 

16 

2 .7 

4 8 

.7.7 

2 6 

0 6 

.7  1 

5.2 

4 1 

.7  ,7 

.7  0 

2 7 

F 

r 
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TABLE  A4.  Percentages  by  which  forecast  underestimates  expected  costs  per  period, 
for  various  classifications  of  72  item:  under  statistical  control 
(Demand  History  Length  52  Periods) 


Inpul  Raraniclcrs  ! 

Cost  (Ompjjncnl 

Tolal 

c;  /c 

oul  in 

. 

/C 

in 

l.ea«  lime  K 

lean  1) 

cmand 

^ ; 

4 

9 

99 

32 

64 

0 

2 

4 

2 

4 

K 

U)  j 

Negative  Binomial  Denianils 

Holding  Cost 

0 K 

1.6 

1.0 

0 3 

0 9 

0.7 

0.2 

0 7 

1 3 

0 8 

0 8 

0 9 

0 7 

Backlog  Cost 

44  6 

16  2 

27.8 

58.6 

45.4 

43  9 

24  2 

43  (1 

55  8 

58  1 

49  0 

39  2 

35  2 

Replenishment  C ost 

06 

0 8 

0 3 

0 6 

0 5 

0 6 

0 6 

0 5 

0 7 

1 6 

0 7 

0 5 

0 2 

Total  Cost 

14  0 

4 9 

6.9 

22  1 

15  3 

12.9 

6 5 

13  6 

19  1 

24  9 

17  5 

1 1 8 

S K 

Negative  Binomial  Demands 
"•  3 

Holding  C’osl 

0 5 

12 

0 6 

0 0 

0.5 

0 4 

0 1 

0 4 

0 7 

0 4 

0 9 

0 4 

n 3 

Backlog  Cost 

28  0 

8.4 

17  6 

47.8 

29  3 

26.7 

10  5 

26  6 

37  4 

36  2 

29  3 

27  1 

235 

Replenishment  Cost 

0.3 

0 3 

0 3 

0 3 

0 3 

0 3 

0 3 

0 4 

0 3 

1 0 

0 5 

0 2 

0 0 

Total  Cost 

50 

2.2 

3 0 

8 1 

5 7 

4 4 

1 5 

4 8 

7 8 

8 0 

5 8 

4 7 

3 h 

Poisson  Demands 

••  I 

Holding  C’ost 

0 2 

0 5 

0 3 

-0  1 

0 2 

0 1 

0 0 

0 1 

0 5 

0 2 

0 4 

0 1 

0 1 

Backlog  Cost 

14  6 

3.7 

98 

39  0 

16  6 

12  8 

1 7 

14  1 

21  7 

1'14 

14  2 

15  9 

119' 

Replenishment  Cost 

0 1 

0.1 

0.2 

0 0 

0 0 

0 2 

0 1 

0 1 

0 1 

0 5 

0 2 

0 0 

0 0 

Total  Cost 

. 

I 5 

0 8 

1 1 

2 4 

1 8 

1 3 

0 2 

14 

2 8 

2 2 

17 

1 6 
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ABSTRACT 

Consider  an  auciion  in  which  increasing  bids  are  made  in  sequence  on  an 
obiecl  whose  value  « is  known  to  each  bidder  Suppose  ii  bids  are  received, 
and  the  distribution  ol  each  hid  is  conditionally  uniform  More  specifically, 
suppose  the  first  bid  V|  is  uniformly  distributed  on  (D.  Wl.  and  the  /^'  bid  is  un- 
iformly distributed  on  l.V,  p wl  for  > “ 2.  . . . n.  A scenario  in  which  this 
auction  model  is  appropriate  is  described  We  assume  that  the  value  W is  un 
known  to  the  statistician  and  must  be  esimaled  from  the  sample 
\\.  V2 V„  Ihj  best  linear  unbiased  estimate  of  ft  is  derived-  The  in- 

variance of  the  eslimaiion  problem  under  scale  transformations  Is  noted,  and 
the  best  invariant  estimate  of  ft  under  loss  l.ift.  a)  - \ia/ftt  - ll^  Is  derived 
it  is  shown  that  this  best  invariant  estimate  has  uniformly  smaller  mean- 
squared  error  than  the  best  linear  unbiased  estimate,  and  the  ratio  of  the 
mean-squared  errors  Is  estimated  from  simulation  experiments  A Bayesian 
formulation  of  the  estimation  problem  is  also  considered,  and  a class  of  Bayes 
estimates  is  explicitly  derived 


I.  INTRODUCTION 

The  bibliography  on  competitive  bidding  compiled  by  R.  M,  Stark  [12]  documents  the  fact 
that  there  has  been  considerable  recent  interest  in  mathematical  models  for  bidding  processes. 
For  the  most  part,  models  proposed  for  such  processes  have  been  used  for  developing  "optimal" 
bidding  strategies.  For  example,  Griesmer  and  Shubik  [4,  5,  6]  and  Griesmer,  Shubik,  and 
Levitan  |7|  formulate  bidding  processes  as  «-person  games,  and  obtain  equilibrium  solutions 
under  a variety  of  constraints  on  the  amount  of  information  on  previous  bids  available  to  each 
bidder.  The  only  distinction  among  bidding  processes  made  in  our  study  is  between  sealed-bid 
and  open-bid  auctions.  In  a sealed-bid  auction,  each  participant  submits  a bid  without  knowing 
the  bids  of  his  opponents,  while  in  an  open-bid  or  progressive  auction,  bids  are  made  sequen- 
tially and  all  participants  are  fully  aware  of  all  previous  bids.  Models  for  open-bid  auctions  are 
considered  in  |6j.  There  seems  to  be  little  work  on  probability  models  for  an  observed 
sequence  of  bids,  and,  to  our  knowledge,  inference  questions  concerning  open-bid  auctions  are 
essentially  untouched.  We  present  in  this  paper  a probability  model  for  an  open-bid  auction, 
and  we  develop  and  compare  .several  estimators  of  the  parameter  of  the  model. 

We  will  consider  an  open-bid  auction  model  to  be  any  probability  model  which  generates  a 
sequence  of  nonnegative,  increasing  observations  (T,  ].  Thus,  the  joint  distribution  of  the  order 
statistics  from  any  continuous  distribution  with  support  in  IR*  could  be  viewed  as  an  auction 
model.  .Similarly,  the  random-record  models  studied  by  I).  CJraver  [3]  and  by  M.  Yang  [15] 
could  serve  as  auction  models.  It  is  clear  that  a definitive  treatment  of  modeling  and  inference 

*fhc  sciond  iiuihor  is  now  <i(  Crlor.ido  SUilc  Univcrsilv 
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questions  for  open-bid  auctions  must  deal  with  the  stopping  rule  in  the  bidding  process  that 
determines  the  size  of  the  sample  of  bids  to  be  examined.  Much  can  be  learned,  however,  by 
examining  the  estimation  problem  as  a fixed-sample-size  problem,  and  Ihis  viewpoini  is  pursued 
in  section  3.  An  application  is  described  in  which  such  a simpliftcation  seems  justified.  In  sec- 
tion 4,  we  present  a Bayesian  formulation  of  the  estimation  problem  in  which  the  optimal  esti- 
mator is  in  fact  independent  of  the  stopping  rule. 

2.  THE  UNIFORM  AUCTION 


We  will  refer  to  the  following  probability  model  as  the  uniform  auction.  Let  « be  a non- 
negative, real-valued  parameter  to  be  thought  of  as  the  value  of  the  object  being  auctioned. 
Let  .V|  ~ f/(0,  «],  that  is,  T , is  uniformly  distributed  on  the  interval  (0,  «).  For 

1=2 n,  let  X,  ~ UlX,  ti].  Taking  -v,,  = 0,  the  probability  density  of  the  vector  of 

bids  X is  given  by 


(2.1) 


/(.v„ 


.v„|«)  = 


ti  1 

n 


for  0 < ,V|  < . . . < .v„  < 9, 


0 otherwise. 


This  probability  model  seems  to  be  new  in  statistical  literature,  and  has  a number  of  interesting 
characteristics.  It  is  not  an  order-statistic  model,  and  it  differs  from  the  distribution  of  the 
order  statistics  from  t/lO,  W]  in  that  the  uniform  auction  typically  produces  a set  of  bids  tending 
much  more  to  the  right  in  the  interval  (0,  III  than  the  set  of  bids  from  the  uniform  order- 
statistic  model.  The  uniform  auction  has  an  interesting  statistical  property  that  renders  the  esti- 
mation of  9 a somewhat  delicate  problem.  One  can  easily  verify  by  the  Lehmann-Scheffe  tech- 
nique 18)  that  the  entire  sample  of  bids  (T| XJ  is  a minimal  sufficient  statistic  for  9. 

Thus,  in  contrast  with  many  bounded  or  semibounded  order-statistic  models,  reduction  by 
sufficiency  does  not  accomplish  a simplification  of  the  problem. 


The  probability  model  considered  in  this  paper  is  supported  by  a number  of  intuitive  con- 
siderations. As  discussed  in  section  3.3,  the  value  9 in  the  uniform  auction  is  a scale  parameter 
of  the  distribution  of  bids.  We  are  thus  able  to  develop  reasonable  estimators  of  9 that  are 
invariant  with  respect  to  scale  transformations  on  the  data.  Such  estimators  have  the  property 
that  the  estimated  values  are  equivalent  regardless  of  the  monetary  units  in  which  9 and  the 
bids  are  expressed.  Stated  informally,  our  estimate  of  9 should  not  depend  on  whether  the  bids 
are  in  dollars  or  in  pennies,  and  each  of  the  estimators  obtained  in  section  3 satisfies  our  intui- 
tion in  this  regard.  It  is  possible  to  arrive  at  the  uniform  auction  rather  than  other  scale- 
parameter  models  from  an  entropy  argument,  since  the  uniform  distribution  on  (.V,,  9]  maxim- 
izes the  entropy  among  all  distributions  on  the  interval.  Thus,  the  uniform  auction  is  a sequen- 
tial formulation  of  maximum  uncertainty  concerning  the  bidding  process. 


Precedents  exist  in  the  literature  on  bidding  processes  for  the  probability  model  con- 
sidered in  this  paper,  as  well  as  for  the  estimation  problem  we  investigate.  In  a celebrated 
paper,  Vickrey  |14|  studied  game-theoretic  aspects  of  competitive  bidding  in  a framework  in 
which  bids  were  modeled  as  uniform  random  variables.  He  postulated  that,  in  a sealed-bid  auc- 
tion, a random  participant  assesses  the  value  of  the  object  at  auction  according  to  a uniform  dis- 
tribution on  some  interval,  taken  as  (0,  I)  for  simplicity.  Under  a linear  utility  function,  the 
unique  equilibrium  strategy  is  for  each  bidder  to  bid  the  amount 


where  v is  the  value  the  /'"bidder  obtains  from  the  uniform  distribution  and  Vis  the  number 
of  bidders.  Thus,  when  bidders  employ  the  equilibrium  strategy  (which  is  also  Pareto  optimal). 


I 


I 

1 


I 
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each  bid  is  itself  a uniformly  distributed  random  variable.  In  an  appendix,  Vickrey  briefly 
discusses  one  possible  extension  to  open-bid  auctions  — the  uniform  order-statistic  model  We 
believe  that  such  an  extension  is  inappropriate,  since  the  distribution  is  not  derived  using  a 
sequence  of  conditional  distributions  based  on  past  bids  and  is  thus  not  patterned  after  a real 
open-bid  auction.  It  is  easy  to  see,  however,  that  the  uniform  auction  is  a natural  extension  to 
open-bid  auctions  of  Vickrey’s  sealed-bid  model.  Suppose  each  bidder  has  an  initial  value 
assessment  v , drawn  independently  from  a uniform  distribution  according  to  Vickrey’s  model 
If  a bidder  emerges  at  random  from  the  A'  participants,  and  he  submits  a bid  according  to  the 
alorementioned  equilibrium  strategy,  the  bid  is  uniformly  distributed.  After  the  first  and  each 
successive  bid,  the  bidders  reassess  the  value  of  the  object  and,  following  Vickrey’s  model, 
obtain  new  values  according  to  a uniform  distribution  on  an  interval  with  a different  lower 
bound.  Under  such  a mechanism  for  reappraisal,  the  resulting  probability  model  is  precisely 
the  uniform  auction.  The  interpretation  of  « as  the  value  of  the  object  is  consistent  with 
Vickrey’s  scheme,  since  it  would  represent  the  least  upper  bound  on  possible  value  assess- 
ments, and,  in  theory,  one  could  find  a buyer  willing  to  pay  more  than  « - t for  the  object  for 
any  « > 0.  An  estimation  problem  in  competitive  bidding  was  examined  by  Christenson  |l|. 
In  that  paper,  a multiparameler  probability  model  was  advanced  as  a distribution  of  an 
opponent’s  bids  in  a multistage  sealed-bid  auction.  Among  the  parameters  of  the  model  was 
the  value  of  the  object  to  the  bidder.  Bayesian  linear  regression  analysis  was  used  for  estimat- 
ing the  parameters  of  the  model. 

While  the  uniform  auction  can  be  viewed  as  a natural  extension  of  Vickrey’s  scheme  for 
value  assessment,  it  can  also  be  motivated  from  a quite  different  scenario.  Consider  an  auction 
with  a large  number  of  expert  bidders.  The  annual  bull  auction  at  Red  Bluff,  California,  would 
be  an  example  (see  Sosnick  111,  page  1310]  for  a description).  Each  bidder  knows  W,  the  value 
(perhaps  the  market  value  or  perhaps  the  break-even  point  in  a marketing  process  after  a pur- 
chase) of  the  object  at  auction.  Such  an  assumption  is  of  course  approximately  true,  at  best, 
but  it  should  be  sufficiently  close  to  reality  in  some  auctions  to  justify  the  use  of  our  model.  In 
his  summary  section,  Vickrey  (14,  page  28)  discusses  the  validity  of  his  model,  stating 

"...  these  conclusions  are  based  on  a model  in  which  a high 
degree  of  rationality  and  sophistication  is  imputed  to  the 
bidders;  nevertheless,  in  many  markets  the  frequency  of  the 
dealings  and  the  professional  characteristics  of  the  dealers  are 
such  as  to  make  such  an  assumption  not  too  far  from  reality." 

(iiven  that  i bids  have  been  made,  the  (/  -t-  I)''  bid  is  assumed  uniformly  distributed.  This 
assumption  might  be  justified  on  behavioral  grounds,  since  the  bid  that  an  aggressive  bidder 
would  consider  would  be  high  in  (T,.  t)|,  while  the  bid  that  a conservative  bidder  would  con- 
sider would  be  low  in  |.V,,  Wl,  and  bence  the  random  bid  that  emerges  from  a large  group  of 
bidders  might  quite  reasonably  be  modeled  as  a uniform  variable.  Finally,  while  the  sample 
si/e  in  an  open-bid  auction  tends  to  provide  information  about  «,  one  might  consider  estimating 
n after  stopping  artifically,  that  is,  after  ti  bids  are  made,  even  though  the  bidding  actually  con- 
tinues. Estimating  II  from  an  artifically  stopped  auction  could  appropriately  be  considered  a 
ftxed-sample-si/e  problem  when  h is  not  too  large  relative  to  the  expected  number  of  bids.  In 
fact,  the  primary  application  of  the  estimation  procedures  discussed  in  the  next  two  sections 
might  well  be  the  estimation  of  value  by  a nonprofessional  (referred  to  henceforth  as  the  statis- 
tician) based  on  the  early  bids  of  expert  bidders. 

We  now  turn  to  the  moment  structure  of  the  uniform  auction  that  comes  into  play  in 
estimating  «.  By  the  recursive  relations 
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EX,  - E\E(X,\X,.t)\ 
together  with  EX\  = W/2,  we  obtain 


0 + EX,. 


/ - 2.  3 n. 


Define 


EX,=  \ - — 9.  / = 1 , 2 n. 

r 


«.=  1 - 


/ = 1,  2 n. 


Similarly,  we  obtain  for  1 < < < « 


Define  for  1 < / < / < n 


cov(X,,  X)  = — ^ 9^. 

2'  '3'  T" 


2'  '3'  2" 


and  let  I be  the  ii  x n matrix  whose  //"'element  is  tr 

3.  ESTIMATING  THE  VALUE  « FOR  A FIXED  SAMPLE  SIZE 

We  now  treat  « as  a parameter  unknown  to  the  statistician.  We  will  compare  three 
approaches  for  estimating  9 based  on  a sample  of  n bids. 

3.1  Functions  of  the  Maximum  Likelihood  Estimate 

The  maximum  bid  X„  is  the  MLE  for  9.  Since  X,,  is  not  sufficient  for  9,  we  would  expect 
to  do  better  with  estimators  based  on  more  information.  We  will,  however,  consider  efficiency 
questions  at  the  end  of  this  section  for  two  estimators  based  on  the  MLE.  Let  denote  the 
unbiased  function  of  the  MLE  given  by 

(3.1) 

where  //,  is  defined  in  (2.3).  Recall  from  (2.5)  that 

(3.2)  <r„„  = 

The  variance  (or  mean-squared  error)  of  the  estimate  9^  may  be  shown  to  be 

<r  „„9  ^ 

(3.3)  var(«|)  - 

II, f 

Let  be  the  multiple  of  the  MLE  with  smallest  mean-squared  error.  One  may  easily  obtain 
that 


1 II, : + ) 

The  mean-squared  error  of  the  estimate  9 2 may  be  shown  to  be 


MSE{92)  - - 

II,,  + „„ 


i 


1 
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3.2.  The  Best  Linear  Unbiased  Estimate 


We  now  derive  the  best  linear  unbiased  estimate  of  9,  that  is,  the  estimate  of  the  form 

= a'X 

with  minimum  variance  under  the  constraint 

a'£'X  = W. 

which  may  be  rewritten  as 


(3.8) 


a 'u  = 1 , 


The  BLUE  derived  in  this  paragraph  is  an  improvement  over  since  «|  is  itself  a linear 
unbiased  estimate  of  9.  The  relative  efficiency  of  9j  with  respect  to  the  BLUE  is  di.scussed  at 
the  end  of  section  3.  The  variance  of  the  estimator  is  given  by 


(3.9) 


var(Wj  = 9^a  'Za 


which  is  to  be  minimized  over  vectors  a satisfying  (3.8).  Using  straight-forward  Lagrange 
minimization,  we  have  by  differentiation 

2W^Ia  = \u. 


(3.10) 

Since  a must  satisfy  (3.8),  we  may  identify  X as 


(3.11) 


X = 


29^ 


u'^Z  'u 


so  that  the  BLUE,  which  we  denote  by  9y,  is  given  by 


(3.12) 

The  variance  of  Wj  is  given  by 

(3.13) 


- ^ u^I  'X 
^ u'Z  'u 


var(f)i)  = 


II  'Z-'h 


3.3  A Best  Invariant  Estimate 

We  note  that  « is  a scale  parameter  of  the  distribution  of  X,  that  is,  the  density  / given  in 
(2.1)  may  be  written  in  the  form 


/(.V, 


, .>r„|«)  = 


i 

n 

.V| 

9 

■ ■ 9 

.^1  X2  X„ 

for0<  — < — <...<  — <1.  We  develop  in  the  paragraphs  that  follow  a Pitman-type 
a H 

estimate  of  w in  a context  to  be  explained  shortly.  We  use  the  notation  and  approach  of  Fergu- 
son (2,  pp.  186-1901  in  this  development. 


We  propose  as  a loss  criterion  the  scale-invariant  loss  function 

... 

(3.14)  1.(9,  a) 


f-' 


With  this  loss  function,  the  decision  problem  of  estimating  9 is  invariant  under  the  group  of 
scale  transformations.  Invariant  decision  rules  are  those  rules  ./which  satisfy  for  all  c € IR'^ 


(3.15) 


./(t-Ti.  cX2 tX„)  ^ cd{X, X„). 
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^^•16)  Y = for  / = 2 n. 

The  disiribution  of  ( Kj,  , Y„)  does  not  depend  on  ti.  We  will  find  the  best  invariant  esti- 
mate of  ft  in  the  conditional  problem  based  on  /(Til  Y 2.  • • . Y„,  «),  where  « is  still  a scale 
parameter.  We  are  able  to  establish  by  the  usual  argument  that  this  rule  is  best  invariant  in  the 
original  problem  based  on  /(T| T„|«).  An  invariant  estimate  in  the  conditional  problem 


where  /j„  = is  such  lhal 

(3.17)  E ^ - 1 Y 

is  minimized  by  b = b„. 

The  transformation  7':  (Ti.  X2 XJ  — (Ti,  Y2. 

(3.18)  \J\^X'{\ 
and  thus 


= Ti' 


Y,,}  has  Jacobian 


(3.19)  /(a'i,  ^2 = 


0 


for  1 < y.  < . . . < v„  < 


and  0 < A | < — 


otherwise, 


where  we  take  Vo  = 0 and  v>|  = 1.  Since  the  conditional  density  /(jfijy,  0)  does  not  come  into 
play  in  our  calculations,  it  will  not  be  derived.  Since  the  risk  function  (3.17)  of  each  invariant 
rule  is  a constant  independent  of  «,  we  will  take  « = 1 for  convenience.  The  conditional  risk  at 
« = 1 for  the  rule  r/ylT)  = bTi  is 

J ItrrJjy'  ... 


/(.Vj .V,,) 


n 1 

n (1  'tt) 


where  .V|  has  been  replaced  by  the  dummy  variable  y.  Minimizing  (3.20)  with  respect  to  /? 
yields 


II  1 

n (i  - .ft) 


/ — ./F 

Ji)  II  I 

n (1  - .f,.f) 

1-1 


We  now  proceed  with  the  evaluation  of  t^i). 


Consider  the  numerator  in  (3.21).  The  decomposition  of  the  integrand  into  partial  frac- 
tions yields  the  following  representation: 


r ''‘.I 


— * . r- 
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where  .‘I  | 4„  | are  Ihe  solutions  of  the  system  of  linear  equations 


(3.23) 

where 

(3.24) 


I 


''I  2 I "//  2 2 


1 

-4i 

1 

....  "1,.  1 

2 

0 

= 

■ • • • ‘'ll  2 ,1  \ 

'^ii  1 

. 

• 

0 

I $ A I < ■ < A < rt  - I 


n va, 

r-i 


^ ! (or  all  r 

The  elements  (w„|/  = 1 n ~ 2]  in  the  column  of  the  (/;  - 1)  x (n  — 1)  matrix  in 

(3.23)  are  simply  the  elementary  symmetric  functions  in  the  letters  y, y,  ,, 

.EfU y„  |.  We  evaluate  the  integrals  in  (3.22)  by  repeated  applications  of  the  recursive 

formula 


f “ IT  " T r 

/)  4-  nr  nni  h 


(3.25) 

a -t-  hx  hm 

We  thus  obtain  for  any  positive  integer  A 


a + hx 


ilx. 


(3 


.26) 

1 - 


I 

J_  i 
.V,  A 

A 

- I 

/ -I 


v,v 


(iv 


I /•  l/l 

- / ' 
V •'0 


,A  I 


A - / + 1 


I - y,y 

A ) M 


(iy 


- /' 

y, 


(/y 


1 - y,y 


1 

1 . 

X 

A / ♦ 1 

A 1 1 

= — 

K 

I 

J_] 

1 

A - / -f-  1 

In 

V, 

1 - — 

/-I 

y, 

y„ 

y, 

y„ 

The  integral  above  can  be  indexed  by  Ihe  upper  limit  of  integration,  y„  the  exponent  of  Ihe 
numerator.  A,  and  the  bid  ratio  involved,  y,.  We  thus  denote  the  expression  in  (3.26)  by 
/(>'„  k,  y,).  Recalling  (3.21)  and  (3.22),  we  may  write  the  best  invariant  estimate  of  « in  Ihe 
conditional  problem  as 


(3.27) 


.4,/(y„  y,) 

■ ■1 

X ■4Hy„  « + 1 , y,) 


Ti. 


The  optimality  of  0^  among  invariant  estimates  in  the  unconditional  problem  may  be  esta- 
blished by  Ihe  usual  argument  (see,  for  example,  [2,  p.  1871). 

3.4  \ Comparison  of  i - 1 , 2,  3,  4 

It  is  easy  to  demonstrate  Ihe  inadmissibility  of  «|,  and  under  any  loss  function  of 
Ihe  form 
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/,(«,</)  = wiinui  - (/)■’, 


which  of  course  includes  squared  error  loss  as  well  as  ihe  loss  function  of  (3.14).  fhe  risk 
function  of  each  of  these  estimators  is  dominated  (uniformly  in  D)  by  the  risk  function  of  /),. 
One  may  argue  as  follows.  The  estimator  M4  is  uniquely  best  invariant  (up  to  equivalence) 
under  the  loss  criterion  (3.14).  Moreover,  W , for  / = I,  2,  3,  are  scale  invariant  estimators  of 
H.  Thus,  for  all  W, 


h 


(3.28) 

for  ( = 1,  2.  3.  Multiplying  both  sides  of  (3.28)  by  m (P)ii’  establishes  the  claim. 


It  remains  to  examine  the  efficiency  o(  the  estimates  >1  . / = i,  2,  3,  among  each  other  as 
well  as  relative  to  In  other  words,  we  week  to  answer  the  question:  "Mow  much  is  lost  by 
using  an  alternative  to  the  very  imposing  estimator  ii^?"  In  Table  I,  we  present  exact  relative 
efficiencies  for  H|,  with  respect  to  W,,  that  is,  we  list  the  ratios  of  their  respective  mean- 
squared  errors.  It  is  interesting  to  note  that  i),  is  to  be  preferred  to  n,  for  a sample  of  two  bids, 
even  through  it  rapidly  loses  ground  as  the  sample  site  increases. 


TABLE 


/; 

RllCu/n^) 

“2“ 

0.%4“' 

1 .048 

3 

0.917 

0.943 

4 

0.868 

0.876 

5 

0.824 

0.826 

6 

o.ni 

0.788 

7 

0.758 

0.758 

8 

0.735 

0.735 

9 

0.718 

0.718 

10 

0.705 

0.705 

The  mean-squared  error  of  W4  is  difficult  to  obtain  analytically.  We  have  generated  500 
samples  of  sizes  2,  3,  and  4 from  the  uniform  auction  with  H = 10.  MSE(«|)  and  MSEIh,) 
were  simultaneously  estimated  from  these  samples,  and  the  estimated  efficiency  of  »,  relative  to 
«4  appears  in  Table  2. 


TABLE  2 


It  seems  reasonable  to  conjecture  even  from  Ibis  very  small  simulation  study  that  the 
BLUE  perlorms  adequately  for  samples  of  size  n > 3 and  clearly  provides  substantial 
simplification  in  the  computations  required.  Combining  Tables  1 and  2 for  n - 2,  one  may 
estimate  as  0.966. 


4.  A BAVESIAN  FORMULATION  OF  TIIF  ESTIMATION  PROBI.EM 


In  the  last  section,  estimators  of  t»  were  developed  based  on  a sample  of  fixed  size  n from 
the  uniform  auction.  While  there  are  situations  in  which  one  might  reasonably  ignore  the  stop- 


f ; 

I ' 
. ( 


. ^ 

2 

2 

- 1 

< i: 

0 

1 

H 

II 

1 

• 

3 

0.952 

4 

0.981 
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ping  mechanism  in  an  auclion,  a general  ireaimeni  ol  the  esiimaiion  problem  should  allow  lor 
the  inlluence  ol  random  slopping.  The  esiimaiion  problem  is  perhaps  besi  lormulaled  as  a 
sequenlial  decision  problem,  a I'ormulalion  in  which  ihe  slopping  rule  is  dealt  wiih  explicitly. 
We  examine  such  a lornuilalion  in  this  section  Set|uenlial  decision  problems  are  discussed  in 
detail  by  J erguson  |2,  C hapter  7| 

A stopping  rule  in  a setjueniial  decision  problem  is  a sequence  of  functions 

'l>(x)  = ((/>„,  t/jilvj,  V,).  ...|,  where  x,,)  represents  the  probability  that 

sampling  is  terminated  after  n observations,  given  that  .Vi  = x | V„  = x„  were  observed. 

A terminal  decision  rule  is  a sequence  of  functions  </(x)  = If/,,.  (/,( v,),  i/jl  x |.  v,).  . . .1,  where 

(/„  represents  the  decision  rule  to  be  employed  il  sampling  stops  after  ii  observations.  In  many 
sequential  problems,  the  statistician  is  able  to  select  'l>  as  well  as  i/,  so  that  the  focus  is  on 
choosing  the  pair  ('l>.  </)  optimally.  The  auction  selling  we  study  here  is  such  that  'l>  is  not 
completely  controllable  (for  example,  bidding  may  slop  after  one  high  bid)  and  is  generally 
unknown.  We  therefore  are  primarily  interested  in  the  selection  of  optimal  estimators  for  given 
fixed  stopping  rules. 

It  is  well  known  that  the  Bayesian  approach  to  sequential  estimation  results  in  estimators 
that  are  optimal  (in  the  Bayes  sense)  regardless  of  what  stopping  rule  is  in  force,  providing  the 
stopping  rule  does  not  depend  on  U.  A preci.se  statement  of  this  result  may  be  found  in  Fergu- 
son (2,  page  3141  Thus,  this  approach  is  well  suited  to  situations  in  which  the  stopping  rule  is 
either  unknown  or  beyond  the  control  of  the  statistician.  The  more  general  (and  substantially 
more  difficult)  sequential  problem  with  stopping  rules  which  may  depend  on  0 will  not  be 
treated  here.  We  develop  below  a class  ol  Bayes  estimators  relative  to  Pareto  prior  distributions 
on  H and  squared  error  loss  based  on  a sample  of  n bids  from  the  uniform  auction.  The 
sequential  decision  procedure  which  minimizes  the  Bayes  risk  for  any  given  stopping  rule  <1>  is 
the  procedure  which  uses  the  Bayes  estimator  for  the  sample  that  becomes  available,  regardless 
ol  size. 


(4.1) 


Ihe  Pareto  density  with  parameters  / and  «,!  is  given  by 

if  0 35  « 


niH\r.  ft,,) 


t 1 II  S5'  »'(), 

0 otherwise. 


where  fi„  > 0 and  r > 0.  If  a random  variable  ft  has  a Pareto  distribution  with  /■  > 1,  then 

/■0„ 

f;n  = !L. 


and  if  r > 2, 


var((()  = 


/•«,? 


(/  - 1)^(/'  - 2) 


We  derive  in  this  section  an  expression  for  the  Bayes  estimate  of  ft  with  respect  to  Pareto  prior 
distributions.  The  expression  (4.2)  involves  definite  integrals  that  are  difficult  to  evaluate  in 
general.  We  evaluate  the  Bayes  estimate  only  for  positive  integer  values  of  the  Pareto  parame- 
ter r.  but  indicate  a simple  transformation  which  is  helpful  in  evaluating  the  estimate  for  posi- 
tive rational  r. 


Ihe  density  function  of  the  vector  of  bids  in  the  uniform  auction  is  given  in  (2,1).  Thus, 
taking  ft  as  a Pareto  random  variable  with  parameters  /•  and  we  obtain  a posterior  density 
proportional  to 


I- 

I 


630 


(■  J SAMANIEGO  AND  L O KAISER 
1 'Vnl  1 


n 


I 


for  0 < .V|  < ...  < .v„  < H,  and  « > Lolling  w,,  = max(.v„,  O,,),  we  may  write  the  Bayes 
eslimale  oC  H tor  squared  error  loss  as 


(4.2) 


ft.  = 


r 'W  _ i_ 

(L  *'  ‘ H - .V 


(Hi 


n 1 

n 


""  ft  - .V 


- do 


The  evaluation  of  ffs  involves  steps  quite  similar  to  those  followed  in  evaluating  ft 4.  We  thus 
include  only  those  details  needed  in  clarifying  the  notation  used  in  our  final  expression  for  ()<,. 
We  first  obtain  a partial  fraction  decomposition. 


(4.3) 


II  I 

n 


0 - .V 


B 

0 - .V, 


where  /i, B,,  1 are  the  solutions  of  the  system  of  linear  equations 


(4.4) 


with 

(4.5)  ^ = Z 

KA,<  1 

Tor  ;ill  r 

We  then  have  the  representation  for  the  numerator  of  (4.2); 

I I f)  1 

I • n 

(4.6) 


1 1 ....  ' 

«l 

() 

('ll  1 2 ....  6 1 „ 1 

1^2 

0 

(’ll  2 1 (’ll  2 2 (’ll  2.11  1 

B„  1 

1 

n \ 


Integrals  of  the  type  in  (4.6)  were  studied  by  Chebyshev,  whose  famous  result  implies  that 
such  integrals  are  elementary  functions  whenever  r is  a rational  number  (see  Riti  110)).  We 
develop  here  the  exact  expression  for  0^  only  for  positive  integers  r ; if  r is  the  ratio  of  integers 
v/r,  however,  the  transformation  ft  = A'  reduces  integrals  in  (4.6)  to  integrals  of  rational  func- 
tions of  A.  The  integral  of  a rational  function  can  always  be  obtained  in  finite  terms  by  the 
method  of  partial  functions. 


(4.7) 


Using  the  recursive  formula 
do 
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valid  for  m > 1,  we  obtain  for  any  positive  integer  m 
r”  do 
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Wc  may  thus  express  ihe  Hayes  rule  relative  to  the  Pareto  prior  with  parameters  i € and  W,, 
as 


X «r(A„  /■  + I) 

(4»)  v\  

X HCix,.  r + 2) 

- - 1 

and  a similar  expression  may  be  developed  when  / is  a positive  rational. 

We  close  this  section  with  two  remarks.  I'irst,  we  note  that  Bayes  estimates  for  the  Pareto 
prior  and  certain  weighted,  squared-error  loss  lunctions  can  be  developed  in  a similar  fashion. 
In  particular,  the  Bayes  estimate  relative  to  the  Pareto  prior,  with  parameters  /■  € / * and  W,,  and 
the  loss  function  of  (3.14),  is  given  by 

,1  I 

X « ( ( /■  -I-  3) 


X « r(.v,,  /•  + 4) 


Secondly,  for  positive  integers  /,  one  may  obtain  an  interesting  class  of  limits  of  Bayes  rules  as 
lilt  0.  The  lorm  ol  these  rules  is  easily  obtained  — one  simply  replaces  w,,  in  the  expression 
lor  n<,  or  W,,  by  .v„.  One  may  show  that  these  limits  of  Bayes  rules  are  invariant  under  scale 
translormations,  indicating  that  the  invariant  procedures  studied  in  section  3 are  similar  to 
Bayes  procedures  for  suitable  choices  of  Pareto  parameters. 
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ABSTRACT 

A model  is  developed  which  may  be  used  to  determine  the  expected  total  > 

cost  ol  quality  control  per  inspection  lol  under  acceptance  sampling  by  variables 

where  several  characteristics  arc  to  he  simultaneously  controlled  Optimization  | 

of  the  model  is  accomplished  through  the  application  of  a conventional  search 

procedure.  The  sensitivity  of  the  model  and  the  optimum  solution  to  the  shape  i 

of  the  underlying  probability  distributions  is  discussed  and  associated  analyses  I 

arc  presented  through  an  example.  i 


INTRODUCTION 

C3vRr  the  past  decade  considerable  attention  has  been  devoted  to  the  subject  of  cost-based 

quality-control  systems.  Hald  1 1 21  and  Horsnell  1 15]  have  developed  and  analyzed  mathemati- 

cal cost  models  for  acceptance  sampling  for  a single  attribute  while  Duncan  15,  71,  Gibra  llOl, 
and  Goel,  et  al  , H 11  have  carried  out  similar  research  with  respect  to  process  control  of  a sin- 
gle variable.  Relatively  little  attention  has  been  given  to  the  problem  of  simultaneous  control 
of  several  attributes  or  variables.  Schmidt  and  Bennett  1291  have  developed  a cost-based  model 
for  multi-attribute  acceptance  sampling.  Ghare  and  Torgersen  l9l  have  examined  the  simul- 
taneous control  of  several  variables  where  the  basis  for  control  is  purely  statistical. 

Montgomery  and  Klatt  1201  have  extended  the  Hotelling  7'^  Model  to  include  economic  con- 
siderations. However,  the  quality-control  system  resulting  from  the  model  by  Montgomery  and 
f Klatt  identifies  an  out-of-conlrol  situation  but  does  not  specify  the  variable  or  variables  causing 

this  condition.  Latimer,  Bennett,  and  Schmidt  1181  have  presented  a model  for  multivariate 
process  control.  However,  an  analogous  model  for  cost-based  acceptance  sampling  is  not  avail- 
• able.  This  is  the  topic  of  this  paper. 

THK  SYSTEM  MODELED 

I 

. The  problem  addressed  in  this  paper  is  that  of  multivariate  acceptance  sampling  where  the 

quality  characteristics  of  concern  are  independently  distributed  random  variables.  The 
mathematical  model  developed  measures  the  expected  total  cost  of  quality  control  per 
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inspection  lot  where  the  sample  mean  for  each  characteristic  is  the  statistic  used  to  assess  the 
acceptability  of  the  lot  with  respect  to  that  characteristic.  The  decision  variables  included  in  the 
model  are  the  sample  size  and  lower  and  upper  acceptance  limits  for  the  sample  mean  for  each 
characteristic.  The  model  applies  to  the  case  where  inspection  may  be  destructive  or  nondes- 
tructive; rejection  of  the  inspection  lot  on  some  characteristics  results  in  scrapping  of  the  lot, 
while  screening  results  when  rejection  occurs  on  others.  The  model  developed  lakes  account  of 
the  costs  associated  with  inspection,  with  screening  and  repair  or  replacement  of  defective  items 
when  the  lot  is  rejected  and  screened,  with  scrapping  the  lot  when  the  lot  is  rejected  and 
scrapped,  and  with  defects  present  in  accepted  and  screened  lots. 

The  mode)  presented  here  is  appropriate  in  cases  where  the  acceptability  of 
manufactured-product  lots  is  dependent  upon  the  lot  mean  values  of  several  measurable  charac- 
teristics. The  lot  means  for  the  several  characteristics  are  assumed  to  be  independently  distri- 
buted random  variables,  and  the  mechanism  used  to  assess  lot  acceptability  is  a variables 
quality-control  system  for  the  lot  mean  of  each  characteristic.  The  problem  is  a familiar  one  in 
the  manufacturing  industry,  although  the  characteristic  ,ot  means  are  frequently  correlated. 
However,  cases  where  these  characteristics  are  independently  distributed  are  not  uncommon,  as 
illustrated  by  the  following  example. 

Consider  a process  for  the  production  of  collap.sible  metal  tubes,  such  as  tooth-paste 
lubes.  These  tubes  are  manufactured  on  an  automated  line  in  lots  of  approximately  14,000, 
Four  characteristics  are  of  particular  concern:  tube  diameter,  tube  length,  neck  length,  and  cap 
torque.  Tube  diameter  is  controlled  by  an  extrusion  process  which  results  in  a formed  tube 
about  1 in.  longer  than  required.  The  end  of  the  tube  is  then  cut  off,  the  final  length  being 
controlled  by  measurement  from  the  neck  of  the  tube.  Plastic  necks  are  then  automatically 
inserted  in  the  top  of  the  tube;  and  the  projection  of  the  neck  is  controlled  by  the  insertion  pro- 
cess, The  final  operation  is  capping  the  lube,  and  the  torque  required  to  remove  the  cap  is 
dependent  upon  proper  adjustment  of  the  canping  operation.  Analyses  to  date  have  indicated 
that  there  is  no  significant  correlation  among  these  variables  for  the  present  process,  although 
the  process  used  previously  resulted  in  slight  correlation  between  tube  diameter  and  length. 
The  latter  condition  resulted  when  the  extrusion  process  produced  lubes  which  were  shorter 
than  the  desired  final  length.  However,  this  failure  has  not  occured  on  the  present  process. 

The  model  may  appproximate  reality  even  when  mild  correlation  exists  among  the  charac- 
teristics of  concern.  However,  this  conjecture  cannot  be  verified  at  this  time,  since  cost  models 
for  the  case  of  correlated  characteristics  have  not  been  addressed  in  the  literature.  Hence,  the 
model  developed  here  represents  only  a first  step  in  the  analysis  of  multivariate  acceptance  sam- 
pling for  quality  control. 

The  mode)  assumes  that  the  characteristics  of  concern  may  be  classified  into  one  of  three 
mutually  exclusive  categories.  The  first  category,  characteristic  class  1,  includes  those  variables 
for  which  inspection  is  destructive  to  the  extent  that  the  product  is  rendered  unfit  for  its 
intended  use  and  for  further  inspection  on  other  characteristics.  The  second  category,  charac- 
teristic class  II,  includes  those  variables  for  which  inspection  is  destructive  in  the  sense  that  the 
product  cannot  be  used  for  its  intended  purpose  once  inspection  on  these  characteristics  takes 
place.  However,  inspection  on  characteristics  in  class  II  does  not  render  the  product  unfit  for 
further  inspection.  The  third  category,  characteristic  class  III,  includes  those  variables  for 
which  inspection  is  nondestructive.  As  the  above  discussion  implies,  rejection  of  a lot  on  vari- 
ables falling  into  classes  I or  II  results  in  scrapping  of  the  inspection  lol,  while  rejection  on  vari- 
ables in  class  III  only  leads  to  screening  of  the  inspection  lot. 
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Let  n,  < 


"2 


^ Hi  be  the  sample  sizes  for  variables  1 through  A.i  in  class  I, 


"*i.i  ^ "H|t2  ^ ^ the  sample  sizes  for  variables  A,  + 1 through  A2  in  class  II,  and 

the  sample  sizes  for  variables  A2  + I through  Aj  in  class  III.  The  inspection 
procedure  is  sequential  and  can  be  summarized  by  the  decision  tree  in  Figure  1,  where  the 
notation  /|A,  /,  indicates  inspection  of  A units  on  characteristic  / and  the  A units 

inspected  are  drawn  from  set  .S],  which  includes  /V  units. 


The  costs  arising  from  the  implementation  of  a multivariate  acceptance  sampling  plan  may 
be  considered  to  fall  into  two  categories,  the  costs  incurred  in  reaching  a decision  to  accept  or 
reject  the  inspection  lot,  and  those  costs  arising  from  implementation  of  the  decision  to  accept 
or  reject  the  lot.  The  cost  of  reaching  the  decision  to  accept  or  reject  a lot  is  the  cost  of 
inspecting  the  units  included  in  the  random  sample  drawn  from  the  inspection  lot.  Specifically, 
if  C),  is  the  cost  of  inspecting  one  unit  for  the  /th  characteristic,  and  the  number  of  unit.s 
inspected  is  n, , then  the  cost  of  inspection  for  the  i th  variable  is  C),  n,. 

When  an  inspection  lot  is  accepted,  .several  cost  components  must  be  considered.  First, 
those  units  destroyed  in  the  inspection  process  result  in  a loss  of  product  value  and  a 
materials-handling  cost,  and  they  may  be  replaced,  resulting  in  an  additional  cost  of  replacing 
the  destroyed  units.  Secondly,  those  units  in  the  sample  which  are  not  destroyed  during  inspec- 
tion but  are  found  to  be  defective  as  the  result  of  nondestructive  inspection  may  be  repaired. 
Finally,  one  cannot  a.ssume  that  an  accepted  inspection  lot  is  defect  free.  Thus,  it  is  assumed 
that  a cost  is  associated  with  each  characteristic  defect  present  in  an  accepted  inspection  lot. 
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If  the  inspection  lot  is  rejected  and  scrapped,  the  cost  incurred  is  that  resulting  from  loss 
the  investment  in  the  lot  as  well  as  the  cost  of  materials  handling  and  carrying  out  the  scrapping 
operation.  When  the  inspection  lot  is  rejected  and  screened,  the  resulting  costs  arc  those  of 
screening  the  units  included  in  the  uninspected  portion  of  the  lot  and  in  the  nondestroyed  por- 
tion of  the  sample  on  the  variable  or  variables  for  which  screening  is  called,  the  cost  of  repair- 
ing defective  units  found  during  screening,  the  cost  of  discarding  and  possibly  replacing  des- 
troyed units,  and  the  cost  of  units  which  are  defective  with  respect  to  those  variables  for  which 
lot  acceptance  occurred. 

The  random  variables  considered  in  the  system  are  the  lot  mean,  the  sample  mean,  and 
the  individual  unit  dimension  for  each  variable  of  concern.  If  .S,,  and  .5^,  are  the  lower  and 
upper  specification  limits  for  the  ilh  variable,  and  if  X,  is  the  measured  value  of  that  variable 
for  a given  unit  of  product,  then  that  unit  of  product  is  considered  defective  with  respect  to  the 
/th  variable  if  X,  < or  X,  > Su,.  Otherwise  the  product  is  acceptable  with  respect  to  the 
/Ih  variable.  Acceptance  or  rejection  of  an  inspection  lot  with  respect  to  a given  variable  is 
determined  by  the  value  of  sample  mean  for  that  variable  and  the  associated  acceptance  limits 
for  the  sample  mean.  Specifically,  if  LCL^  and  UCL,  are  the  lower  and  upper  acceptance  limits 
for  the  sample  mean  for  the  / th  variable,  and  if  X,  is  the  value  of  the  sample  mean  for  that 
variable,  _then  the  inspection  lot  is  accepted  with  respect  to  the  ith  variable  if 
LCL,  < A’,  < UCL,  and  is  rejected  otherwise. 

The  purpose  of  the  multivariate  quality-control  system  modeled  here  is  to  detect,  isolate, 
and,  in  the  case  of  screening,  rectify  those  inspection  lots  where  the  lot  mean  for  one  or  more 
variables  significantly  deviates  from  its  desired  value.  Since  one  cannot  normally  predict  the 
value  of  the  lot  mean  for  a given  variable  with  certainty,  the  lot  mean  is  considered  to  be  a ran- 
dom variable,  as  indicated  above. 

The  model  developed  in  this  paper  represents  the  expected  total  cost  of  quality  control  per 
inspection  lot  and  is  a function  of  the  decision  variables  mentioned  above.  The  values  of  the 
decision  variables  which  minimize  the  expected  total  cost  of  quality  control  per  lot  are  deter- 
mined through  the  use  of  a conventional  search  technique,  the  pattern  search  [42]. 

Additional  Notation 

L = the  number  of  items  in  an  inspection  lot. 

(ju,,  (t})  = mean  and  variance  of  product  dimension  for  the  / th  variable  in 

an  inspection  lot. 

M = the  vector  (mi,  M2 

/ (/X,)  — probability  density  function  of  . 

“ conditional  probability  density  function  of  X,  for  samples  drawn 
from  an  inspection  lot  with  mean  /x, . 

^(•x;|/x,)  - conditional  probability  density  function  of  X,  for  units  drawn  from 

an  inspection  lot  with  mean  /x, . 

h(x,\n,)  dx,. 

1m,)  “ f.  h(x,\ti,)  dx,. 

P(G,\„,)  - 1 - Pi  DM  - Pi  DM 
PiA,\n,)  - ' 2f(x;|/x,)  dx,. 
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('.(  (accept, 

c;,.  c; 

(scrap, 

C'«(/I) 

c;, 

^ tr  / ( ■ ^ u th 

rs(M) 

c, 


= expected  cost  of  sampling  inspection  given  the  vector 
= cost  which  results  from  the  presence  of  the  / th  variable  below  the 
lower  and  above  the  upper  specification  limit  in  an  accepted  lot. 

= expected  cost  of  accepting  the  inspection  lot  given  acceptance 
and  the  vector  Ji. 

= expected  cost  of  accepting  the  in.spection  lot  given  the  vector  Ji. 

= unit  cost  of  production  and  materials  handling  respectively. 

= expected  cost  of  scrapping  the  inspection  lot  given  rejection  and  the 
vector  /2. 

= expected  cost  of  scrapping  the  inspection  lot  given  the  vector  /I. 

= unit  cost  of  performing  the  screening  inspection  for  the  i th 
variable. 

= cost  of  repairing  one  unit  of  product  found  to  have  the  / th 

variable  below  the  lower  and  above  the  upper  specification  limits  respectively. 
= expected  cost  of  screening  the  inspection  lot  given  the  vector  jl. 

= total  expected  cost  of  quality  control  per  inspection  lot  submitted 
for  control. 

= total  number  of  items  removed  from  the  inspection  lot  for  the  purpose  of 
sampling  inspection. 


max  5;  n,  + X «,  + "<3 


= min{22,  — 21, 

1.  ",  - «Aj). 

2 = ^2  + 1.  •• 

1^2. 

“ max{2i,  — 2)j^ 

,01,  / “ /tj 

1+1,  ...  ky 

1.  > "‘2 

0,  22,^  < 


S|, 


1, 

0.  n,  < 2)*^ 


/ - /( 2 + 


^■3 


I,  units  destroyed  during  sampling  inspection  are  replaced  in  accepted  or 
screened  lots 

62-  -j 

• 0,  units  destroyed  during  sampling  inspection  are  not  replaced  in  accepted  or 

> \ screened  lots 


Additional  Assumptions 

I The  random  variables  Ml Ma  , are  independently  distributed,  as  are  T,  X.  and 

a;,. 

2.  Errors,  if  any,  which  occur  in  sampling  or  screening  inspection  are  assumed  to  be  negligi- 
ble. 

3.  A single  defect  type  can  occur  only  once  in  a unit  of  product. 


» 
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4. 


f 

A unit  that  has  been  rendered  unfit  for  its  intended  use  during  sampling  inspection  is 
scrapped,  but  it  may  or  may  not  be  replaced  if  the  inspection  lot  is  accepted  or  screened. 


5. 

I 


In  an  inspeciton  lot  that  has  not  been  scrapped,  a unit  that  has  not  been  destroyed  during 
sampling  inspection  but  has  been  found  defective  during  subsequent  sampling  or  screen- 
ing inspection  on  a nondestructible  variable  type  is  assumed  to  be  repaired. 


6.  A unit  that  has  been  accepted,  or  repaired,  but  has  multiple  defects  will  incur  the  cumula- 
tive cost  of  accepting,  or  repairing,  each  characteristic  type. 

• .. 

7.  If  the  inspection  lot  is  accepted  on  all  destructible  characteristic  types,  and  the  policy  is  to 
replace  all  units  destroyed  in  sampling  inspection,  replacement  units  are  screened  on  all 
nondestructible  characteristic  types  and  defectives  are  repaired  accordingly. 

8.  When  screening  for  multiple  characteristic  types,  the  inspection  of  any  one  unit  is  carried 
out  to  completion  even  though  a defect  in  the  unit  may  be  detected  prior  to  the  inspection 
of  all  characteristic  types  on  that  unit. 

9.  An  inspection  lot  which  is  accepted  on  all  destructible  characteristic  types  cannot  be 
scrapped.  If  the  inspection  lot  is  subsequently  rejected  on  one  or  more  screenable  charac- 
teristic types,  the  remainder  of  the  inspection  lot  is  screened  for  those  variable  types  only. 
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Inspection  Cost 


If  sampling  is  conducted  on  the  / th  variable,  the  cost  of  inspection  is  C/,n,.  For  variables 
in  classes  1 and  11,  / = 1,  2,  ...  , ki,  the  conditional  probability  that  inspection  is  carried  out, 

given  Jl,  is  unity  for  / = 1 and  H for  i = 2,  3,  ...  . /cj.  Inspection  is  carried  out  on 

/-I 

variables  in  class  III  only  if  the  inspection  lot  is  accepted  on  all  variables  in  classes  I and  II. 
Hence,  the  conditional  probability  that  inspection  is  carried  out  on  variables  in  cla.ss  III,  given 
*2 

M,  is  n 

/-I 


Thus 


(1) 

*2 

Cfijl)  - ^ 

C,,n,  'fi  /*(>!, Im,) 

+ 

^-3 

1-2 

;-l 

1 — k 2^- 1 

npiA,\pi,). 


(-1 


• - 

r 


I 


Acceptance  Cost 


The  first  cost  considered  when  an  inspection  lot  is  accepted  is  that  resulting  from  units 
destroyed  (classes  I and  II)  and  units  repaired  (class  III)  in  the  sample.  Thus,  the  conditional 
expected  cost  of  units  destroyed  or  repaired  given  lot  acceptance  and  jl  is  given  by 


multivariate  sampling  plans 


639 


(2)  E(i:ost  of  destroyed  and  repaired  units  | accept,  /I), 


(<;,  + (•,)  I n.  + n,  + c;  I 8„^„  1 - n 


It  is  assumed  that  units  destroyed  during  sampling  inspection  may  or  may  not  be  replaced 
if  the  inspection  lot  is  accepted.  The  policy  to  replace  or  not  to  replace  is  defined  by  82.  All 
replacement  units  are  screened,  and  repaired  where  necessary,  on  class  III  variables.  However, 
replacement  units  which  are  defective  with  respect  to  class  I or  II  variables  pass  undetected. 
Hence, 


(3)  Efreplacement  cost  [ accept,  /I)  - 82  '^n,  + Ic,, 


+ Z [C„  + C„u,P(Da,i^,)  + C„,,,P(D,J^,)] 


When  an  inspection  lot  is  accepted,  there  may  be  some  units  in  the  sample  which  have 
not  been  destroyed  and  which  have  not  been  inspected  on  some  of  the  variables  of  concern. 
Such  units  may  contain  defects  with  respect  to  those  variables  for  which  inspection  was  not 
conducted  and  the  associated  expected  cost,  given  acceptance  and  ]1,  is  given  by 

(4)  E(cost  of  defects  in  uninspected  and  nondestroyed  portion  of  the  sample  | accept,  ]1) 


The  final  cost  component  arising  when  an  inspection  lot  is  accepted  is  that  resulting  from 
the  presence  of  defective  units  in  the  uninspected  portion  of  the  lot,  which  is  comprised  of  L- 
N*  units.  iV*  is  the  total  number  of  items  removed  from  the  lot  for  the  purpose  of  sampling 
inspection.  Then 

(5)  E(cost  of  defects  in  the  uninspected  portion  of  the  lot  | accept,  /I) 


a - /V*)  t \c,u,P(Du,\^,)  + 


The  conditional  expected  total  cost  of  acceptance,  given  acceptance  and  is  then  given 
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(6)  C^Caccept,  fi)  - E(cost  of  destroyed  and  repaired  units  | acc'pl,  /T) 

+ E(replacement  cost  (accept,  /u) 

+ E(cost  of  defects  in  uninspected  and  nondestroyed  portion  of  the 
sample  1 accept,  /T) 

+ E(cost  of  defects  in  the  uninspected  portion  of  the  lot  | accept,  /I) 
and  the  conditional  expected  cost  of  acceptance,  given  jx,  is 

^ J 

“ C.,(accept,  m) 

<-i 


Scrapping  Cost 

An  inspection  lot  is  rejected  and  scrapped  only  if  the  lot  is  rejected  on  one  of  the  variables 
in  classes  I or  11.  The  conditional  expected  cost  of  scrapping,  given  jl,  is 

*2 
(-1 


(8) 


Q(m)  « L{C,+  C,) 


Screening  Cost 


An  inspection  lot  is  screened  whenever  it  is  accepted  on  all  variables  in  classes  I and  11 
but  is  rejected  on  one  or  more  variables  in  class  III.  Screening  inspection  then  occurs  only  for 
those  variables  in  class  111  for  which  rejection  occurs.  The  expected  cost  of  carrying  out  the 
screening  operation,  given  acceptance  on  class  I and  11  variables  and  /I,  is 


(9)  Elscreening  cost  | acceptance  on  1 and  II  variables,  jZ) 

~{L-N')  ^ CJl  - />(.4,|4,)1. 

I 1 

In  addition,  the  units  in  the  sample  which  were  not  destroyed  during  sampling  inspection  and 
which  were  not  inspected  on  the  rejected  variable  must  also  be  screened,  if  any  such  units  exist. 
Hence,  the  total  conditional  cost  of  the  screening  operation,  given  acceptance  on  class  I and  II 
variables  and  p,  including  the  quantity  given  in  (9),  is 


(10)  Eltotal  screening  cost  | acceptance  on  I and  II  variables,  jl) 


I c;, 

-*2+l 


L - /V*  + 8„l8|,(n*j  - n,)  + n,^  - H - R{A,\n,)\. 


As  in  the  case  of  lot  acceptance,  the  cost  of  units  destroyed  (classes  I and  ID  and  rejected  and 
repaired  (class  III)  during  inspection  must  be  accounted  for.  The  cost  of  units  destroyed 
(cla.s.ses  I and  II)  or  rejected  and  repaired  (class  III),  given  acceptance  on  variables  in  classes  I 
and  n and  is 


1^ 
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(11)  E(cost  of  destroyed  and  repaired  units  | acceptance  on  I and  11  variables,  /I) 


1 * 1 

[ 

- 

(C,,  + r,)  ^ n,  + riAj 

+ C 

Z 8i. 

I -1 

1 -Aj+I 

- n 


I 


1 - n 


Those  units  not  destroyed  in  the  sample  but  which  have  been  screened  and  rejected 
given  variable  must  be  repaired.  This  cost  is  given  by 

(12)  E(screening  and  repair  cost  for  the  sample  | acceptance  on  class  I and  II  variables,  Ji) 


= z 


~ tl  — ^((j,Imi)1  + — n)  + (1  — 8|,)  — n^^)] 


on  a 


[C„u,P(Dtj,\^i,)  + C„L,P(D^,\^L,)]  (1  - P(A,\^,) 


Since  the  screened  lot  is  eventually  treated  in  a fashion  similar  to  that  in  the  case  of  lot 
acceptance,  units  destroyed  in  the  sampling  inspection  may  or  may  not  be  replaced  and,  as  in 
the  case  of  lot  acceptance,  are  screened  on  class  III  variables  but  may  contain  undetected 
defects  with  respect  to  variables  in  classes  I and  II.  Hence 


(13)  E(replacement  cost  | acceptance  on  class  I and  II  variables,  jl) 


Z ",  + «A 


C,.  + Z [c.,  + C„a,P(Dtj,\^,)  + C,,l,P(D,Jh,)\ 

/-A,+l  * 1 


*3 

+ tlC,u,P(Du,\tJL,)  + C,,,P(D,J^,)\ 

1 - n 

/-*2+l 

In  addition  to  undetected  defects  contained  in  replacement  items,  undetected  defects  with 
respect  to  a given  variable  may  occur  in  units  which  have  neither  been  destroyed  nor  inspected 
on  that  variable.  This  component  of  cost  arising  from  screening  the  inspection  lot  can  be 
expressed  as 


(14)  E(undetected  defects  in  the  sample  | acceptance  on  class  1 and  II  variables,  jl) 


K Z 

<-A j+l 


8i/ 


Z c„u,P(Du,\n,)  + c;„,p («,.>,) 


1 - n 

(-A,+l 


+ 8|,("a,  - ",)  + (I  - 6,,)  (n*^  - rt*j)j 
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1 - n /’<'4,Im,) 


^2T 

1^1 


The  final  two  components  of  cost  associated  with  screening  an  inspection  lot  are  those 
relating  to  the  repair  or  replacement  of  defects  found  during  screening  of  the  uninspected  por- 
tion of  the  lot  and  the  occurence  of  undetected  defects  in  the  uninspected  portion  of  the  lot. 
The  conditional  cost  of  handling  and  repairing  defects  found  during  screening  inspection  of  the 
lot,  given  acceptance  on  class  I and  11  variables  and  ^ is 


(15)  E(retification  cost  for  the  screened  lot  | acceptance  on  class  1 and  II  variables,  /T) 


(L  - /V*)  £ 


C-,(l  - + C,„,P(D,,,W,) 


■ 11  - P{A,\ti)\. 

The  cost  of  accepted  defects  in  the  uninspected  portion  of  the  lot,  given  acceptnce  on  all  vari- 
ables in  classes  I and  II  and  /T,  is 

(16)  E(cost  of  undetected  defects  in  the  uninspected  portion  of  the  lot  | acceptance  on 
class  I and  II  variables,  fL)  = (L  — N*) 


^3 

i(Q:,/'(//i/,(M,)  + C,,_,P(DL,\,Ji,)] 

1 - n /’(-4,1m,) 

>-\ 

+ (L  - N') 

* 1 

Z (Cy,P(Dt„|M,)  + Q,,,P(D,,,|M,)l/’(-4,lM,)  • 

1 - n /’(4,1m,) 

1 2+ 1 

Summing  (8)  through  (16)  yields  the  conditional  expected  total  cost  resulting  from 
screening,  given  acceptance  on  class  I and  II  variables  and  jl,  Q (acceptance  on  class  I and  II 
variables,  jT)  and 

— 

(17)  C’v(m)  = C's (acceptance  on  class  I and  II  variables,  fl)  n/’('4,iM,). 

,-l 


The  total  expected  cost  of  quality  control  per  inspection  lot,  Cj,  is  then  given  by 


(18) 


lc,(fx)  + + Cv(m)1  ./'(m,)  </m. 


RESULTS 

Application  of  the  model  and  its  optimization  are  illustrated  in  the  following  example. 

EXAMPLE  1:  A control  device  is  composed  of  three  fuses,  each  of  which  controls  the 
flow  of  current  in  a different  circuit.  The  device  is  a component  of  a piece  of  electrical  equip- 
ment which  will  fail  if  either  too  much  or  too  little  current  flows  in  any  circuit,  resulting  in 
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repair  of  the  equipment  at  final  testing.  Inspection  of  the  control  device  consists  of  measuring 
the  current  at  which  each  fuse  burns  out.  Burn  out  of  fuses  1 or  2 normally  results  in  damage 
to  the  device  sufficient  to  render  inspection  on  any  other  variable  impractical.  However,  testing 
the  third  fuse  does  not  damage  the  device  sufficien.ly  to  eliminate  testing  on  the  other  vari- 
ables. A fourth  variable  inspected  is  the  diameter  of  the  device,  to  make  sure  it  will  fit  into  the 
final  assembly.  The  data  for  this  example  are  given  in  Table  1;  current  is  measured  in  milliam- 
peres  and  diameter  is  measured  in  10  * meters.  The  following  approximate  relations  hold: 


ix} 

•X,  — A(/n,,  (z/).  X,  N 

Mm  

n, 

. P,  ~ N IX„,  , 

TABLE  1.  Parameters  for  Example  / 


V'jriuhle 

Class 

■V,. 

■V, 

y. 

tr . 

'tr 

< 1 

1 Muse) 

II 

78  40 

81  60 

1 67 

80,00 

0,50 

S5.5,85 

S55,8.5 

so  005 

2 (lu>,e> 

II 

45  80 

54,20 

1 67 

50  00 

1 40 

34  75 

34,75 

0 005 

3 ((use) 

II 

23  00 

27,00 

1 67 

25  00 

0 80 

12,00 

12,00 

__ 

0(K)5 

4 (diiimetcr) 

III 

212.30 

217,70 

1,67 

215,0(1 

1 00 

5 00 

5,00 

W,75 

J4,75 

S0.005 

0 005 

A,  - 2 

A,  - 3 

^ 1 

- 4 

/.  - 

10,000 

, 

S2.00 

C,  - 

S)00 

- 1 

Optimization  of  the  model  can  be  achieved  through  application  of  an  iterative  search  pro- 
cedure, in  this  case  the  pattern  .search.  The  purpose  of  the  search  is  to  define  values  of  the 
decision  variables  which  minimize  the  total  cost  of  quality  control.  Because  of  the  exploratory 
nature  of  search  procedures,  the  "indicated"  optimum  may  not  be  the  true  optimum.  However, 
for  those  cases  investigated  in  this  research,  the  cost  at  the  indicated  optimum  was  always 
within  0.10%  of  the  least-cost  solution  optained  by  subsequent  enumeration  about  the  "indi- 
cated" optimum  identified  by  the  search  procedure. 

Two  approaches  have  been  explored  in  applying  an  iterative  search  technique  to  optimiza- 
tion ol  the  model.  In  both  approaches  a starting  point  representing  no  sampling  and  implying 
lot  acceptance  on  all  variables  («,  — 0,  t = 1,  2,  3,  4)  was  chosen.  In  the  first  approach  the 
pattern  search  is  applied  directly  to  the  multivariate  model  with  the  starting  point  represented 
by  lot  acceptance  on  all  variables.  The  second  approach  consists  of  applying  the  model  to  each 
variable  separately,  as  though  the  remaining  variables  were  not  to  be  considered.  The  pattern 
search  is  then  applied  to  each  single-variable  model,  using  lot  acceptance  as  the  starting  point  in 
each  case.  The  least-cost  solution  for  each  variable  treated  independently  is  then  used  as  the 
starting  point  for  optimization  of  the  model  when  all  variables  are  considered.  The  latter 
approach  has  been  found  to  be  more  efficient  with  respect  to  execution  time  on  a digital  com- 
puter, and  it  is  the  approach  presented  here. 

The  results  of  the  application  of  the  pattern  search  to  the  model  when  each  variable  is 
treated  separately  is  summarized  in  Table  2.  The  components  of  the  expected  cost  of  quality 
control  C\.  Cf.  and  C,,  where  C ; - E\C,(p:)\,  C\  - £[C.,(m)1,  C«  - £[Q(m)1, 
and  C I,  — £[(  s^mI).  given  for  each  variable  are  those  costs  which  arise  when  the  variables  are 
treated  independently  and  arc  therefore  misleading.  As  already  indicated,  in  the  second  stage 
of  the  search,  the  least-cost  solution  for  sample  size  and  lower  and  upper  control  limits  given  in 
Table  2 for  each  variable  are  used  as  the  starting  point  for  the  search  for  the  optimum  when  all 
variables  are  represented  in  the  model.  The  second  stage  of  the  search  is  summarized  in  Table 
3.  The  total  expected  cost  of  quality  control  per  inspection,  Cj,  and  the  components  thereof, 
('/.  Cf.  Cf,  and  C's,  for  the  starting  point  for  the  second  stage  of  the  search  (optimum  points 
in  Table  2)  and  for  the  final  optimum  indicated  by  the  search  are  given  in  Table  3. 
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TABLE  2.  OptmuzaKon  oj  cadi  variable  trcaicj  tniJcpcinIcnily 


Variable 

Point 

n 

LCL, 

UCL, 

' ■ “ 1 

(■/ 

.... 

. - - .. 

C'« 

Cs 

C'/ 

1 

Start 

0 

— 

— 

so.oo 

56367.62 

50.00 

07)0 

6367.62 

Opt. 

30 

79.25 

80.75 

0.15 

3879.90 

1189.01 

0.00 

5069.06 

2 ^ 

Start 

0 

— 

— 

0,00 

6147.27 

0.00 

0.00 

6147.27 

Opt. 

32 

47.94 

52.06 

0.16 

3905,91 

1274.14 

0.00 

5180.21 

3 

Start 

0 

— 

0.00 

5772,03 

0.00 

0.00 

5772.03 

Opt. 

22 

23.70 

26.30 

0.1 1 

4573.38 

861.95 

0.00 

5435.44 

4 

Start 

0 

— 

— 

0.00 

1639.23 

0.00 

0.00 

1639.23 

Opt. 

36 

214.40 

215.60 

0,18 

303.00 

0.00 

1292.69 

1595.87 

TABLE  3.  Final  optimization  of  the  model 


Variable 

i 

Point 

n, 

LCL, 

UCL, 

c, 

c, 

C’re 

c. 

1 

30 

79.25 

80.75 

. 

2 

32 

47.94 

52.06 

2 

5.55 

56344.28 

53  1 46.50 

56016.18 

515,507,51 

3 

22 

23.70 

26.30 

4 

36 

214.40 

215.60 

1 

36 

19A2 

80,58 

2 

E 

3 

34 

48,34 

51.66 

E 

1.13 

4127.35 

6846.35 

3653.21 

14,628.04 

3 

28 

23.98 

26.02 

4 

o 

1 

214 

214.37 

215.63 

Existence  of  a Unique  Minimum 

In  applying  a .search  procedure  to  the  optimization  of  a mathematical  model,  one  tacitly 
assumes  that  the  model  possesses  a single  minimum  or  maximum,  as  opposed  to  several  local 
optima.  In  some  cases  this  condition  can  be  verified  analytically.  In  many  cases,  however,  the 
existence  of  a single  minium  (or  maximum)  cannot  be  proven  analytically  but  may  be  demon- 
strated by  examination  of  representative  example  ca.ses.  The  latter  approach  was  taken  in  the 
case  of  the  model  presented  here.  It  should  be  noted  that  examination  of  a representative  set 
of  example  problems  does  not  "prove"  that  the  model  possesses  a unique  minimum  (or  max- 
imum), but  it  can  indicate  that  uniqueness  of  the  optimum  can  be  expected  in  a large  number 
of  cases.  In  this  research  the  authors  were  not  able  to  identify  a case  for  which  the  model  pos 
sessed  several  local  minima. 

Two  approaches  were  taken  to  determine  whether  local  minima  existed  for  several  exam 
pie  problems  analyzed  First,  several  starting  points  were  chosen  for  application  of  the  search 
procedure.  For  each  example,  the  optimum  indicated  by  the  search  for  the  various  starting 
points  clustered  in  the  same  neighborhood  and  resulted  in  total  costs  of  quality  control  which 
were  within  0.10%  of  one  another. 


The  second  approach  to  the  analysis  of  the  behavior  of  the  function  requires  the  selection 
of  a random  point  in  the  solution  space  and  evaluation  of  the  function  on  the  line  segment 


multivariatf.  sampling  plans 


645 


between  that  point  and  the  optimum  indicated  by  the  search  procedure.  This  process  was 
repeated  for  five  ditTerenl  randomly  selected  points  for  several  examples.  Specifically,  let  A,  be 
the  maximum  deviation  from  M/,i  analyzed  for  the  lower  and  upper  control  limits,  M,  be  the 
maximum  sample  size  for  the  / th  variable,  and  the  /•),,  ri,,  and  r^,  be  random  numbers  associ- 
ated with  the  sample  size  and  the  lower  and  upper  control  limits  for  the  / th  variable,  and  n,', 
LCL',  VCL'  be  the  associated  optimum  sample  size  and  lower  and  upper  control  limits.  Let  X 
and  X * be  the  vectors  defined  by 


/ 

' n 1 

r,. 

LCLl 

Mio  - A|  rii 

Mil)  + A|  rai 

UCLl  1 

m2  rij 

«2  ! 

1 

M2I)  ~ ^2  ''22 

LCL '2  ' 

^20  + Aj  r32 

f/CLj 

X - ' 

X*  = . 

1 

1 

"a'  i 

*3  1 

j.o  ~ A*  j /■2,*  j 

LCL,.  1 

h 1 

Ma,.o  + A*  /-j * 

UCL, . 

\ 

‘3  . 

Then  a convex  combination  of  X and  X*,  AX  + (1  - A)  X*{0  < A < 1),  yields  a point  on  the 

line  segment  joining  X and  X *. 

1 

1 

For  each  example  problem  examined  the  total  cost 

of  quality  control  associated  with  * 

aX  -t-  (1  - A)  X*  proved  to  be  greater  than  at  X * indicating  that  the  search  technique  does  in 
fact  identify  a point  as  optimum  which  is  close  to  the  tfue  global  optimum.  In  addition,  this 

analysis  showed  that 

r 

(19)  C/U,  X -F  (1  - A|)  X‘)  < LVlAjX -1-  (1  - Aj)  X*1 

for  A I > A;.  Equation  (19)  implies  that  the  cost  model  probably  possesses  only  one  local 
minimum,  at  least  for  the  examples  treated. 

I 


As  a corollary  to  the  analysis  just  described,  the  authors  have  analyzed  several  examples  < ^ 

by  varying  each  of  the  decision  variables,  one  at  a time,  from  their  optimal  values  and  evaluat-  I, 

ing  the  total  cost  of  i uality  control  as  a function  of  these  deviations.  This  analysis  indicates  ! 

cost  is  most  sensitive  to  departures  of  the  control  limits  from  optimal  for  those  variables  for 
which  inspection  is  destructive  (classes  I and  II)  and  least  sensitive  to  departures  of  sample  size 
from  optimal  for  variables  for  which  inspection  is  nondestructive  (class  III).  The  variation  of 
total  expected  cost  with  variation  of  the  decision  variables  (one  at  a time)  is  illustrated  in  Fig- 
ures 2 and  3 for  the  problem  in  Example  I For  Figure  2,  the  abscissa  is  the  ratio  njn\  / “ 1, 

2,  3,  4 In  Figure  3 the  abscissa  R is  the  multiple  of  i — 1,  2,  3,  4,  such  that 

(20)  - ^,0-F 
and 

(21)  LCL,~  n.,„- 

and  )(  — 0 0375  yields  the  optimum  values  of  UCL,  and  LCL,. 
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MCHIRK  3 Toial  expetlccl  cost  nf  quality  amirol  per  lot  as  a funclion  of  devia- 
tion of  UCL  from  U('l.‘  and  IX’I.  from  1.(1.’  for  the  problem  in  Example  I 


Sensitivity  Analysis 

To  apply  the  model  presented  in  this  paper,  several  system  parameters  must  be  estimated 
and  probability  distributions  identified.  One  of  the  most  difficult  estimation  problems  is  that 
associated  with  identification  of  the  density  functio.n  of  the  lot  mean  for  each  of  the  variables 
considered,  in  general,  the  parameters  of  these  density  functions'  can  be  estimated  through  the 
collection  of  a moderate  v rlume  of  data.  However,  identification  of  the  shape  of  each  density 
function  may  require  an  extensive  and  expensive  data-collection  effort. 

The  research  of  Latimer,  Bennett,  and  Schmidt  [18]  and  Schmidt  and  Bennett  [29]  has 
indicated  that  in  many  cases  the  optimal  sampling  plan  and  the  total  cost  of  quality  control  are 
not  sensitive  to  the  shape  of  the  distribution  of  the  random  variables  which  are  subjected  to 


648 


S.  C OJAPMAN,  J W SCHMIDT  AND  C,  K BPNNKTT 


control.  However,  a method  for  analyzing  the  sensitivity  of  the  model  to  distributional  shape  is 
necessary  if  this  analysis  is  to  be  carried  out  in  specific  applications  of  the  model.  The  result  of 
such  an  analysis  will  indicate  the  extent  of  the  data-collection  effort  required.  That  is,  if  the 
model  proves  to  be  insensitive  to  distributional  shape  for  a given  application,  then  the  analyst 
will  require  only  a rough  idea  of  the  shape  of  the  density  functions  involved.  On  the  other 
hand,  if  the  model  proves  to  be  sensitive  to  distributional  shape,  then  an  extensive  data  collec- 
tion effort  may  be  in  order  to  specify  precisely  the  density  function  of  the  lot  mean  for  each 
variable  to  be  controlled. 

If  we  let 

/(m.) 

(■; 

hti,) 

C, 

then 

(22)  p = --  — (100) 

t / 

is  the  percent  loss  realized  as  a result  of  using  the  assumed  density  function  /(/u,)  instead  of 
the  actual  density  function  /(m,)-  To  illustrate  the  sensitivity  analysis  suggested  here,  consider 
the  following  example. 

EXAMPLE  2\  The  problem  considered  here  is  the  same  as  that  presented  in  Example  1. 
The  assumed  density  function  of  /z„  ./(/z,).  is  normal,  with  mean  and  variance  given  in  Table 

1,  ( = 1,  2,  3,  4;  it  is  assumed  that  the  mean  and  variance  have  been  estimated  with  reasonable 

precision.  Now  we  choose  a density  function  ,/'(/z,)  having  the  same  mean  and  variance  as 
/(m,)  but  a different  shape  for  all  /.  For  this  example  we  will  choose  distributions  having  the 
shapes  shown  in  Figure  4.  That  is,  Cj  is  evaluated  when  i = I,  2,  3,  4,  has  the  shape 

given  in  Figure  4b.  Then  C]  is  evaluated  with  this  same  set  of  density  functions,  fip.,),  / = 1, 

2,  3,  4.  The  percent  loss  realized  is  then  calculated  by  equation  (22).  The  process  is  then 
repeated  with  /(/z,,,  i = 1,  2,  3,  4,  taking  on  the  shapes  given  in  Figures  4c,  4d,  and  4e  succes- 
sively. Since  the  distributional  shapes  given  in  Figures  4b,  c,  d,  and  e represent  rather 
signficant  departures  from  the  assumed  shape  given  in  Figure  4a,  calculation  of  the  percent  loss 
in  each  case  should  indicate  the  sensitivity  of  the  model  for  the  application  given  in  Example  1 
to  significant  errors  in  identifying  the  true  shape  of  Up.,),  / ==  1,  2,  3,  4.  The  results  of  this 
analysis  are  summarized  in  Tables  4 and  5.  Table  4 presents  the  optimum  cost  of  quality  con- 
trol per  lot  when  f(p,)  is  known  for  each  of  the  distributions  shown  in  Figures  4b,  c,  d,  and  e. 
Table  5 gives  the  percent  loss  due  to  determination  of  the  optimum  sampling  plan  when  the 
density  functions  of  p,  are  assumed  to  be  normal,  / = 1,  2,  3,  4,  but  actually  assume  the 
shapes  given  in  Figures  4b,  c,  d,  and  e. 

The  results  summarized  in  Tables  4 and  5 indicate  that  errors  in  identifying  the  shape  of 
the  distribution  of  the  lot  mean  are  not  .serious,  at  least  for  this  example.  While  general  con- 
clusions cannot  be  drawn  regarding  the  sensitivity  of  the  model  and.  the  optimum  soluion  to 
distributional  shape,  these  results  are  not  surprising  in  light  of  similar  results  given  by  Schmidt 
and  Bennett  [291  and  Latimer,  Bennett,  and  Schmidt  [181.  In  any  case,  the  method  presented 
for  sensitivity  analysis  may  be  applied  to  any  specific  case  where  the  model  is  used. 


= p.d.f.  of  p, , 

= total  expected  cost  of  quality  control  per  lot  under  the  optimum  set  of 
sampling  plans  if  / (p,)  is  known, 

= assumed  p.d.f.  of  p,, 

= total  expected  cost  of  quality  control  per  lot  under  the  optimum  .set 

of  sampling  plans  r/cnvcr/ when  /(p,)  is  assumed,  but  ('vo/uutcr/ using  Zip,), 
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FlCillRI-:  4 Assumed  (a)  and  Irue  (b.c.d.e)  density  functions  used  for 
the  sensitivity  analysis  in  Kxample  2. 


TABLE  4.  Optimum  sampimf;  plans  and  expected  total  cost  oj  quality 
control  per  lot  for  the  distributions  shown  in 
Fifiures  4h,  c,  d,  and  e 


Variable 

Number 

Sample 

Size 

Control  Limits 
(Lower,  Upper) 

Total  Expected 
Cost 

1 

38 

(19.393,  20.606) 

b 

2 

38 

(48.266,  51.733) 

514,837.43 

3 

38 

(3.978,  6.021) 

4 

105 

(14.224,  15.784) 

1 

48 

(19.410,  20.591) 

c 

2 

46 

(48.303,  51.695) 

515,358.03 

3 

42 

(4.026,  5.974) 

4 

136 

(14.312,  15.694) 

1 

0 

— 

d 

2 

0 

— 

.516,061.91 

3 

0 

— 

4 

234 

(14.433,  15.554) 

1 

0 

— 

e 

. 2 

0 

— 

515,739.88 

3 

0 

— 

4 

326 

(14,298,15.700) 

TABLE  5.  Percent  loss  due  to  assuminp  that  p,,  i = /,  2.  J.  4, 
IS  normally  distrihuted  when  p,  has  the  density 
function  f(p,),  1=1,2,  3,  4,  for  each  of  the  shapes 
Miven  in  Pipures  4h,  c,  d,  and  e 


c; 

C 1 

P 

b 

514,837.43 

514,872.21 

0.23% 

c 

15,358.03 

15,402.16 

0.29% 

d 

16,0.61.91 

16,261.81 

1.24% 

e 

15,739.88 

16,316.37 

3.66% 

650 


s c CHAPMAN,  J W SCMMIOT  AND  Ci  K BKNNKTT 


BIBLIOGRAPHY 

(1)  ASQC  Standard  A3  — 1964,  "Glossary  of  General  Terms  Used  in  Quality  Control," 
(American  Society  for  Quality  Control,  1964). 

I [21  Bather,  J,  A.,  "Control  Charts  and  the  Minimization  of  Costs,"  Journal  of  the  Royal  Statist- 

' ical  Society,  Series  B,  25  (1963). 

[31  Bowker,  A.  H.,  and  H.  P.  Goode,  Sampling  Inspection  hy  Variables,  (McGraw-Hill,  New 
York,  1952), 

l4l  Dodge,  H.  F.,  and  H.  G.  Romig,  "A  Method  of  Sampling  Inspection,"  Bell  System  Techni- 

Ical  Journal  H,  613-631,  (October  1929), 

(5]  Duncan,  A.  J.,  "The  Economic  Design  of  X Charts  Used  to  Maintain  Current  Control  of  a 
Process,"  Journal  of  the  American  Statistical  Association  5/,  228-242  (June  1956). 
l6l  Duncan,  A.  J.,  Quality  Control  and  Industrial  Staiisiics,  (Richard  D.  Irwin,  Inc.,  Homewood, 
Illinois , 1965). 

17]  Duncan,  A.  J.,  "The  Economic  Design  of  X Charts  When  There  is  a Multiplicity  of  Assign- 
able Causes,"  Journal  of  the  American  Statistical  Association  66,  107-121  (March 
1971). 

[81  Ervin,  N.  E.,  "Simplified  Variables  Sampling— Graphical  Method,"  ASQC  Conference 
Transactions,  767-776  (American  Society  for  Quality  Control,  1966). 

|91  Ghare,  P,  M.,  and  P.  E.  Torgersen,  "The  Multicharacteristic  Control  Chart,"  Journal  of 
Industrial  Engineering  /9,  269-272  (June  1968),  _ 

HOl  Gibra,  Issac  N.,  "Economically  Optimal  Determination  of  the  Parameters  of  X Control 
Charts,"  Management  Science  17,  635-646  (May  1971). 

Ill)  Goel,  A.  L.,  et  al.,  "An  Algorithm  for  the  Determination  of  the  Economic  Design  of  X 
Charts  Based  on  Duncan’s  Model,"  Journal  of  the  American  Statistical  Association  63, 
304-320  (March  1968). 

il2l  Hald,  A.,  "The  Compound  Hypogeometric  Distribution  and  a System  of  Single  Sampling 
Inspection  Plans  Based  on  Prior  Distributions  and  Costs,"  Technometrics  2,  275-339 
(I960). 

(131  Hamaker,  H.  C.,  "Economic  Principles  in  Industrial  Sampling  Problems  — A General 
Introduction,"  Bulletin  of  the  International  Statistical  Institute  33,  105-122,  (1951). 

114)  Hill,  I.  D.,  "The  Economic  Incehtive  Provided  by  Sampling  Inspection,"  Applied  Statistics 
9,  69-81  (1960). 

(151  Horsnell,  G.,  "Economical  Acceptance  Sampling  Schemes,"  Journal  of  the  Royal  Statistical 
Society,  Series  A 120,  148-201  (1957). 

' (161  Hotelling,  H.,  "Multivariate  Quality  Control,"  in  Techniques  of  Siaiistical  Analysis,  C. 

Eisenhart,  et  al.,  eds.  (McGraw-Hill,  New  York  1947). 

(17)  Knappenberger,  J.  A.,  and  A.  H.  E.  Grandage,  "Minimum  Cost  Quality  Control  Tests," 
AlIE  Transactions  / (1969). 

f (18)  Latimer,  B.  A.,  G.  K.  Bennett,  a)  1 J.  W,  Schmidt,  "The  Economic  Design  of  a Dual  Pur- 

pose Multicharacterislic  Quality  Control  System"  AIIE  Transactions  5 (3)  (1973). 

(191  Lieberman,  G.  J.,  and  G.  J.  Resnikoflf,  "Sampling  Plans  for  Inspection  by  Variables,"  Jour- 
. nal  of  the  American  Statistical  Association  50,  457-516  (1955). 

[201  Montgomery,  D.  C.,  and  P.  J.  Klatt,  "Minimum  Cost  Multivariate  Quality  Control  Tests 
AIIE  Transactions  4,  I03-II0  (1972). 

(2Il  Mood,  A.  M.,  "On  the  Dependence  of  Sampling  Inspection  Plans  Based  Upon  Population 
Distributions,”  Annals  of  Mathematical  Statistics  14,  415-425  (1943). 

(221  Owen,  D.  B.,  "Summary  of  Recent  Work  on  Variables  Acceptance  Sampling  With  Em- 
phasis on  Non-Normality,"  Technometrics,  II,  631-637  (1969). 

(231  Owen,  D.  B.,  "Variables  Sampling  Plans  Based  on  the  Normal  Distribution,"  Technometrics 
' 9,417-423  (1967). 


multivariate  SAMPLING  PLANS 


651 


(24|  Owen,  I).  B.,  "One-Sided  Variables  Sampling  Plans,"  Industrial  Quality  Control  22  450-456 
(l%6). 

(25|  Palazzi,  A.,  "Statistical  Inspection  by  Variables  — Some  General  Observations,"  Quality, 
EOQC  Journal  75  6-7  (Spring  1971), 

l26l  Pt'an/agl,  J.,  "Sampling  Procedures  Based  on  Prior  Distributions  and  Costs,"  Technometrics 
5,  47-61  (1963). 

[27]  PfanzagI,  J.,  and  W,  Schuler,  "The  Efficiency  of  Sequential  Sampling  Plans  Based  on  Prior 

Distributions  and  Costs,"  Technometrics  12,  299-310  (1970). 

[28]  Quality  Cost  — Cost  Effectiveness  Technical  Committee,  ASQC,  "Quality  Costs  — What 

and  How,"  (American  Society  for  Quality  Control,  1967). 

[29]  Schmidt,  j.  W.,  and  G.  K.  Bennett,  "A  Cost  Model  for  the  Simultaneous  Control  of 

Several  Attributes,"  AllE  Transactions  4,  194-199  (1972), 

[30]  Schmidt,  J.  W.,  and  R.  E.  Taylor,  "A  Dual  Purpose  Cost  Based  Quality  Control  System," 

Technometrics  15,  151-166  (1973). 

131]  Shewhart,  W.  A.,  Economic  Control  oj  Quality  of  Manufactured  Product,  (Van  Nostrand, 
New  York,  1931). 

[321  Schilling,  E.  G.,  "Variables  Sampling  and  Military  Standard  414,"  ASQC  Conference  Tran- 
sactions, 175-188  (1970). 

[331  Shuler,  W.,  "Multistage  Sampling  Procedures  Based  on  Prior  Distributions  and  Costs," 
Annals  of  Mathematical  Statistics  38,  464-470  (1967). 

[341  Shakun,  M.  F.,  "Multivariate  Acceptance  Sampling  Procedures  for  General  Specification 
Ellipsoids,"  Journal  of  the  American  Statistical  Association  60,  905-913  (1965). 

[351  Sittig,  J.,  "The  Economic  Choice  of  Sampling  Systems  in  Acceptance  Sampling,"  Bulletin  of 
the  International  Statistical  Institute  33,  51-84  (1951). 

[361  Smith,  B.  E.,  "The  Economics  of  Sampling  Inspection,"  Industrial  Quality  Control  21  453- 
458  (1965). 

[371  Sriva.stava,  A.  B.  L.,  "Variables  Sampling  Inspection  for  Non-Normal  Samples,"  Journal  of 
Scientific  and  Engineering  Research  5 145-152,  (1961). 

138)  Stacy,  E.  W.,  et  al.,  "Determining  Economic  Sampling  Plans,"  IBM  Systems  Journal  8, 
220-244  (1969). 

[391  Weibull,  W.,  "A  Method  of  Determining  Inspection  Plans  on  an  Economic  Basis,"  Bulletin 
of  the  International  Statistical  Institute  33,  85-104  (1931). 

[40|  Weiler,  H.,  "On  the  Most  Economical  Sample  Size  for  Controlling  the  Mean  of  a Popula- 
tion," Annals  of  Mathematical  Statistics  23,  247-254  (1952). 

[411  Wetherill,  G.  B.,  Samplinf;  Inspection  and  Quality  Control,  (Methuen,  London,  1969). 

[42[  Wilde,  D.  J.,  and  C.  S,  Beightler,  Foundations  of  Optimization,  (Prentice-Hall,  Englewood 
Cliffs,  N.  J.,  1967). 


V 


MARKOV  CHAIN  WITH  TABOO  STATES  AND  RELIABILITY 


G L.  Sriwaslav 

Di'imrimenl  of  Siatislics 
Dihruxarli  Univer\ify 
l)ibriiyarli-7S6l)()4.  Inilia 

S.N.N.  Pandit 

Di'paritm'iil  of  Mailicmalic  s, 
Rff-ional  Knuim’iTiiix  COIIfno 
Waran>tul-f>l)6l)(l4,  India 


ABSTRACT 

A compulalionally  Icasihlc  matrix  method  is  presented  to  (ind  the  (trst- 
passage  prohiihilities  in  a Markov  chain  where  a set  oT  states  is  taboo  during 
transit.  This  concept  has  been  used  to  evaluate  the  reliability  ol  a system 
whose  changes  in  strength  cait  be  thought  of  as  a Markov  chain,  while  the  en- 
vironment in  which  It  IS  I'unctioning  generates  stresses  which  can  also  he  en- 
visaged as  another  Markov  chain 


I.  INTRODUCTION 

The  "strength"  X{t)  of  a system  can  be  thought  of  as  a random  process,  and  the  "stresses" 
Y(i)  acting  on  the  system  can  be  thought  of  as  another  stochastic  process.  The  system  will 
work  as  long  as  A"!/)  > XC/),  and  will  fail  at  t * defined  by 

Xit)  ^ Y(t).  I < r*  and  XU*)  < YU') 

The  reliability  problem  we  are  proposing  here  is:  Find  the  probability  distribution  of  /*, 
the  time  to  failure  of  the  system,  given  an  adequate  description  of  the  stochastic  processes  Xli) 
and  Y(f). 

The  problem  can  also  be  thought  of  as  the  time  to  first  zero-cro.ssing  of  the  difference 
process  ZU)  “ X(i)—Y(i),  ZU)  being  positive.  The  problem  considered  in  the  present  study 
is  a discrete  version  of  the  above  problem,  and  can  be  described  as  follows:  We  replace  the 
continuous  stochastic  processes  Xii)  and  Yit)  by  discrete  Markov  chains  X{r)  and  Y{r), 

r -0,1.2 with  stationary  transition  probabilities,  where  X(r)  and  Y(r)  take  values  from 

discrete  sets. 

We  assume  that,  at  each  step  r — 0,1,2 the  system  will  have  one  of  the  possible 

strength  levels  identified  by  the  numbers  1,2,3 n.  The  system  strength  Xir)  transits  from 

one  step  to  the  next  as  a Markov  chain  with  stationary  transition  probabilities  matrix  P —(Pn), 
I,  f —1,2 n. 
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Similarly,  at  each  r,  the  "environment"  impinges  a stress  Y(.r),  identified  by  the  numbers 

1,2 m,  which  is  again  a Markov  chain  with  stationary  transition  probabilities  matrix 

Q = (</«),  /,  A-  = 1,2 m. 

Let  us  define  the  "compound  state"  of  the  system  as  the  pair  ia.fj),  where  a is  the 
strength  state  and  (i  is  the  stress  state;  it  gives  the  state  of  the  system  at  any  step. 

Let  Z = [zii.l,  J,k))  = P the  Kronecker  product  of  P and  Q.  It  is  a square  matrix 
of  order  N = n y.  m and  gives  the  one-step  transition  probabilities  of  transition  from  the  com- 
pound state  (1,1)  to  the  compound  state  (J.k). 


The  compound  states  of  the  system  are  classified  into  "survival"  set  S and  "failure"  set  /■'as 
follows: 


(/,  /)  € 5 if  / > / and  (/,  /)  € f if  / < /. 


The  problem  is  to  find  the  system-survival  function,  i.e.,  the  chance  that  the  system  does 
not  enter  Fsei  in  the  first  r steps,  r = 0,1,2,  . . conditional  to  a specified  initial  state. 

The  question  essentially  is  one  of  determining  the  first-entry  probabilities  with  respect  to 
Z,  taking  the  failure  states  as  taboo  states. 


The  first-entry  problem  with  a prescribed  taboo-state  set  is,  of  course,  not  new.  For 
instance,  the  well  known  relation  [2] 


/’(/  + !) 
fitii 


Pafi 


fir^\ 

Jttti 


A -I 


Paa- 


giving  the  probability  for  the  first  entry  to  state  ft  at  the,  (r-l-l)th  step,  gives  the  answer  to  our 
problem  when  the  taboo  set  is  just  the  single  state  fi.  However,  it  is  obvious  that  the  above 
formula  cannot  be  used  when  the  taboo  set  consists  of  more  than  one  state. 


Chung  ll]  has  given  a recurrence  relation 

r~  I * 

P)  “ A U /■)-(-  ^ (p,  A U F)  (r  -p,  F) 

1—1 


where  fis  the  taboo  set  and  k ^ F. 


However,  this  relation  is  essentially  conceptual  and  is  of  no  help  at  all  in  the  actual  deter- 
mination of  failure  probabilities;  it  is  obvious  that  the  computation  of  p„it(r,  F)  by  this  method 
requires  a knowledge  of  p„n(r,  A U f)  as  well  as  of  p„iiiv,  k U F)  and  p^^ir  — p,  F)  ; thus  beg- 
ging the  question  itself.  Hence  the  need  for  a constructive  procedure  to  solve  the  problem. 

2.  FIRST-PASSAGE  PROBABILITIES  IN  THE  PRESENCE  OF  A TABOO  SET 

We  shall  now  present  a matrix  approach  to  solve  the  problem.  The  relevant  concepts  are 
introduced  first  and  then  the  procedure  is  developed.  It  is  then  illustrated  by  a simple  numeri- 
cal example.  Let  Z be  a transition  probability  matrix  of  order  A'  and  let  /■-(«,,  oj  ...,«*)  be 
the  set  consisting  of  A states,  among  N states,  which  are  taboo  in  transition. 


I 
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We  define  a deficient  matrix  Z'  relative  to  the  taboo  set  F.  Z'  is  obtained  from  Z 
= [z{i.l\J,k)]  by  replacing  all  z(i,l,j.k)  by  zeros  when  / < /or ./  < A or  both;  only  those  z(/./; 
i.k)  when  i > /and  J '^k  will  remain  nonzero  in  Z'. 

Then,  obviously,  /7„(r  + l./'),  the  probability  of  the  system  initially  at  / reaching  state  7 at 
the  (r  + Dth  step,  avoiding  visits  to  states  in  F during  the  first  r steps,  is  given  by 
P„(r  + \.  F)  = (Z')'Z. 

Now  Z' is  a nonnegative  matrix  with  each  of  its  row  totals  < 1,  since  by  assumption  P 
and  Q are  positive  matrices.  Hence  , by  an  extension  of  Frobenius’  theorem  [3],  all  charac- 
teristic roots  of  Z'are  necessarily  less  than  1 in  magnitude.  Consequently 

(2.1)  = ® 

and  (/-Z  ) is  nonsingular. 

Since  (Z  )'  gives  the  probabilities  of  r step  transitions  from  a nontaboo  state  to  a nonta- 
boo state,  avoiding  entry  into  taboo  states  during  transit,  we  have  (Z')'Z  giving  transition  pro- 
babilities of  reaching  from  a nontaboo  state  a to  any  state  /3  in  r-t-1  steps,  avoiding  taboo  states 
in  the  first  r steps. 


Let  U(S)  be  a column  vector  of  order  N having  1 in  places  corresponding  to  nontaboo 
states  S and  zero  elsewhere,  and  let  6/(F)  be  another  column  vector  of  order  A^with  1 in  places 
corresponding  to  taboo  states  Fand  zero  elsewhere.  Then  (Z')'  ' Z t/(S)  gives  the  probabili- 
ties that  starting  from  state  a.  a = 1,2,  . . ..N,  the  system  does  not  fail  in  the  first  r steps.  Simi- 
larly, Z U(F)  gives  probabilities  that  starting  from  state  a failure  occurs  for  the  first 

time  at  the  r th  step. 


As  lim  (Z')'  = 0,  the  system  must  fail  ultimately.  Hence,  the  event  of  nonfailure  in  the 

f — *00 

first  r steps  is  identical  with  the  event  of  (first)  failure  in  one  of  the  steps  r + 1,  r+2 «>. 

Thus  we  have, 

(zr-'  z u(s)  = £ (z)'-^"  z u (F) 

1-0 

Since, 


Z U(S)  = Z U(S) 

(zr  ' zv(s)  - (zrf,  (zr  z u(f), 

r-O 

i.e., 

(2.2)  (ZrU(S)  - (Z')'(/-Z')-'  Z V(F) 

We  remark  that  for  computational  purposes  we  can  delete  the  zero  rows  and  columns  of 
Z'  with  corresponding  changes  in  the  other  matrices.  We  see  that  the  deletion  of  the 
corresponding  rows  from  the  column  matrix  Z U(F),  along  with  Z\  will  not  change  the  right- 
hand  side  of  equation  (2.2).  Let  f/„(S)  be  the  abridged  form  of  LUS)  obtained  by  deleting 
zero  rows  in  U(S).  Let  Z'^  be  the  abridged  form  of  Z'.  Then  Z'^  is  nonsingular,  and  so  is 
(l-Z'J,  as  Z'„  has  all  its  characteristic  roots  less  than  1 in  magnitude.  Hence  from  equation 

(2.2)  we  have 
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<2.3)  U,{S)  - U-Z'J-' Z,U,{F) 

In  equation  (2,2)  /•  is  a random  variable  representing  the  number  of  steps  up  to  which  the 
system  does  not  fail.  Its  conditional  probability  distribution  on  condition  that  the  system  starts 

in  state  o (<»  = 1,2 N),  is  given  by  the  row  corresponding  to  state  a of  the  column  vector 

sequence 

(Z'J'Z,U„{F).  r =0,1,2 oo. 

Hence,  for  a given  r and  the  2 matrix,  conditional  reliability  of  the  system  can  be  evaluated 
from  (2.2)  on  condition  that  the  system  starts  in  state  a.  Unconditional  reliability  R{r)  of  the 
system  for  r steps  is  then  given  by  R(r)  = (Z'J'  UJS),  where  p„  is  the  row  matrix  of  order 

N giving  the  probability  distribution  of  initial  states  of  the  system,  and  p,„  is  its  abridged  form. 

The  moment  generating  function  of  ris  given  by 

Af(«)  = f (Z’^rZ,  UJF)  = (/-e*  Z;)-'  Z„  UJF). 

r~0 

The  conditional  t-ih- factorial  moment  of  rcan  be  directly  obtained  as 
M|„|  =-Flr(r-l)  ...(/•-</+  I)  /aj 
“(Z'j'fz-z;)-'-'  z,  (yjF) 

~(Z',)  (/-Z'J--  UJS).  by  (2.3). 

The  vth  unconditional  factorial  moment  is 

m'w  = P,„(Z' ) (/-z:,)-"  U,(S). 


An  Illustration 

Let  us  consider  an  example  with  strength  and  stress  transition  probabilities  matrices  P and 
Q,  respectively,  as  follows: 


0.6  0.3  O.I 
0.4  0.4  0.2 
O.I  0.6  0.3 

Then, 

Z ~ P y.  Q 

States 


0.3 

0.2 


0.7 

0,8 


and  Q - (//,*) 


1 

2 

3 

4 

5 

6 

1 

0.18 

0.09 

0.03 

0.42 

0.21 

0.07 

2 

0.12 

0.12 

0.06 

0 28 

0.28 

0.14 

- 3 

0.03 

0.18 

0.09 

0.07 

0.42 

0.21 

4 

0.12 

0.06 

0.02 

0.48 

0.24 

0.08 

5 

0.08 

0.08 

0.04 

0.32 

0.32 

0.16 

6 

0.02 

0.12 

0.06 

0.08 

0.48 

0.24 
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Clearly  states  2,  3,  and  6 are  failure  states.  Hence  the  abridged  deficient  matrix  is, 


and 


Z' 


[0.18  0.42  0.21 
|0.12  0.48  0.24 
0.08  0.32  0.32 


(/  - Z'„)-' 


0.2768  0.3528  0.2100 
0.1008  0.5408  0.2220 
0.0800  0.2960  0.3760 


- 0.167840. 


V(S)  - ( 1 0 0 1 1 0 ),  and  V(F)  = (01  1001),  where  V(S)  and  V(F)  are  tran- 
sposes of  {J{S)  and  V(F)  respectively. 


It  can  be  easily  verified  that 


(/-z;)-'  Z„  U^F) 


1 

1 

1 


where  rows  corresponding  to  the  failure  states  are  deleted. 


We  may  note  that  to  calculate  the  probability  of  success  in  the  first  r steps  it  is  not  neces- 
sary to  calculate  the  powers  of  Z^  . The  following  recurrence  relationship  can  be  used: 

Let  (ZJ'  t/„(5)  - </(r.5) 

=the  reliability  vector  at  the  rih  step, 

then  q(r  -E  1,  5)  = Z'^  qir,  S) 

“the  reliability  vector  at  the  (r-t-Df/i  step. 

Obviously  </(0,  S)  - transpose  of  (1,1,1  ). 


For  the  given  example,  for  r - 0,  1.  2.  3.  4,  we  have 


2; 

q(r,  s) 

r - 0 

r - 1 

r - 2 

/•  * 3 

r - 4 

0.18 

0.42 

0.21 

1 

0.81 

0.6490 

0.518148 

0.412641 

0.12 

0.48 

0.24 

1 

0.84 

0.6732 

0.536472 

0.427178 

0.08 

0.32 

0.32 

1 

0.72 

0.5640 

0.447885 

0.356447 

Let  the  initial-state  probabilities  of  the  system  be  given  by  p„  — (0.3,  0.1,  0.2,  0.1,  0.2, 
0.1),  and  the  abridged  form  by  p,^  — (0.3,  0.1,  0.2). 

Then  R(r),  reliability  at  the  rth  step,  is  given  by 

/?(!)-  0.471,  R{2)  - 0.375060,  RO)  - 0.298669,  /?(4)  - 0.237799. 
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The  conditional  expectation  of  r on  the  condition  that  the  system  starts  in  state  a is  given 
by 


E(r/a)  = Z'„  (/-Z;)-'  f/„(S) 


4.002383 

4.145377 

3.480458 


Hence  the  unconditional  expectation  of  r is  given  by 


Eir)  - 1.2007149  + 0.4145377  + 0.6960916  = 2.3113442. 
Similarly,  E(r^)  = 20.3167038. 

Hence  V(r)  ~ 14.9743918, 
and  S.A(r)  - 3.8696759. 
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ABSTRACT 

I 

This  paper  examines  righl-hand  side  sensilivily  analysis  in  linear  program-  ! 

mmg  as  a problem  in  optimal  sampling  Specifically,  the  insensitivity  point  of  a I 

solution  IS  defined  as  the  point  at  which  the  expected  gain  from  increased  accu-  I 

racy  in  the  prediction  of  a resource  fevel  is  equaf  to  the  expected  cost  of  procur- 
ing the  information  The  problem  is  structured  using  the  rudiments  of  optimal 
statistical  decision  theory 

INTRODUCTION 

This  paper  is  concerned  with  sensitivity  analysis  in  linear  programs.  Usually,  when  we  1 

have  considered  a .solution’s  sensitivity,  we  have  found  (for  example)  the  range  that  the  right- 

hand  sid  could  vary  before  a change  in  basis  was  required.  The  implicit  motivation  was  that  if  , 

the  right  Hand  side  changed,  and  the  range  of  values  over  which  the  optimal  basis  was  applica-  * 

ble  was  in  some  sense  "small,"  then  we  would  be  operating  wi.th  a solution  to  the  wrong  prob- 
lem. In  this  paper  we  will  attempt  to  formalize  this  notion. 

We  will  argue  that  the  real  problem  is  to  find  the  value  for  the  right-hand  side  that  best 
trades  off  potential  lo.sses  (for  using  an  incorrect  value)  with  the  cost  of  obtaining  the  improved 
solution.  We  will  limit  ourselves  to  changing  only  one  element  of  the  right-hand  side,  which 
will  be  a .sample  mean  instead  of  the  unknown,  true  value. 

Two  problems  will  be  discussed.  In  the  first  we  assume  some  data  has  been  used  to  form 
an  estimate  and  we  formulate  the  function  to  be  minimized  that  indicates  how  much  more  data  ‘ (, 

should  be  gathered  so  as  to  "desensitize"  the  optimal  solution.  We  will  then  examine  the  prob-  f 

lem  ol  finding  the  overall  sample  size  in  order  to  produce  an  insensitive  solution.  : 

THE  BASIC  PROBLEM 

Consider  the  following  linear  program: 

(LP)  max  px 


subject  to  Ax  ^ h, 
X ^ 0, 
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where  p and  x are  k-veclors,  6 is  a nonnegalive  /n  vector  and  /f  is  a m x k matrix.  We  will 
assume  that  p and  A are  perfectly  known,  as  are  all  elements  of  h except  for  h„  which  will  have 
to  be  estimated.  Finally,  we  assume  that  ti  data  points  are  used  to  construct  an  unbiased  esti- 
mate of  A,,  which  we  shall  symbolize  as  h,{n).  While  we  have  m mind  the  sample  mean  as  the 
estimator,  it  will  be  seen  that  other  estimators  (such  as  regression  coeflicienis)  could  also  be 
used.  However,  for  simplicity  we  take 


n 


Jn 


: exp 

77^7 


where  6,  is  the  true  value  of  the  amount  of  the  resource  on  hand._  Let  the  optimal  solution 
to  (l.P)  when  b,(n)  is  usedt  for  h,  be  .v  with  as.sociated  basis  matrix  k Further  let  the  optimal 
solution  to  (LP)  when  the  true  value  of  the  /'*’  resource  (namely  h,)  is  used  be  x ‘with  associ- 
ated basis  matrix  B *. 


In  general,  our  intuitive  feeling  is  that  a solution  is  sensitive  to  the  right-hand  side  value 
if  "small"  changes  in  the  value  might  require  a change  of  basis.  We  will  assume  that  the 
changes  in  the  value  of  6,(«)  came  about  from  changes  in  the  data  used  to  estimate  h,  as  n is 
changed. 

Figure  1 illustrates  the  problem  to  be  discussed.  The  solid  lines  represent  constraints 
while  the  dashed  line  represents  the  objective  function.  Two  bell-shaped  curves  are  drawn, 
centered  about  the  true  value  of  the  right-hand  side  of  the  third  constraint.  If  this  value  were 
known  then  the  optimal  solution  would  occur  at  point  A.  Let  the  curve  labelled  1 represent  the 
density  function  of  the  sample  mean  given  that  n data  points  have  been  used  to  form  the  sam- 
ple. For  this  right-hand  side  the  solution  is  sensitive  in  the  sense  that  the  sample  mean  might 
easily  place  the  constraint  considerably  to  the  left  or  right  of  the  true  location,  e.g.,  it  might  end 
up  at  B or  C. 

There  are  basically  two  approaches  to  the  problem.  One  method  would  be  essentially  to 
chance-constrain  the  program.  This  would  include  increasing  the  sample  size  until  the 
estimated  variance  of  the  sample  mean  were  in  some  sense  "small",  i.e.,  the  probability  ot  a 
sample  mean  occurring  outside  of  some  interval  about  the  true  value  would  be  less  than  or 
equal  to  «.  This  is  represented  in  Figure  1 by  the  curve  labelled  2.  This  requires  the  setting  of 
a at  some  arbitrary  level  and  thus  really  does  not  clarify  the  .sensitivity  problem  at  all.  Now  the 
solution  is  sensitive  to  «,  which  was  arbitrary. 

A second  approach  is  available  if  the  cost  of  gathering  data  is  expressible  in  the  same 
units  as  the  objective  function,  e g.,  dollars.  We  will  examine  this  approach  in  detail  in  the  rest 
of  the  paper.  We  will  assume  that  a data  cost  (unction  can  be  obtained  and  that  it  (aithlully 
reflects  the  costs  of  acquiring,  processing,  storing  and  providing  sample-mean  estimates.  We 
assume  that  costs  are  a function  of  the  amount  of  data  only  and  thus  lAn)  is  the  cost  for  pro- 
viding 6,  (/;). 

In  what  follows  we  will  develop  loss  functions  and  state  the  sensitivity  problem  as  finding 
the  sample  size  that  minimizes  expected  loss  plus  cost.  Thus,  rather  than  .setting  an  arbitrary 
parameter,  we  will  consider  a solution  to  be  in.sensitive  when  the  marginal  expected  gain  from 
extra  information  is  just  balanced  by  its  marginal  cost. 


I 


t We  refer  lo  ihe  whole  rixhi-hand  side  in  this  ease  as  hin) 


l.INKAR  PROGRAM  RIGMT-HANI)  SlOP 
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Consider  a decision  maker  who  wishes  to  solve  (LP)  with  h(n)  as  the  right-hand  side. 
Me  may  choose  to  produce  the  activities  associated  with  Bon  the  assumption  that  h(n)  is  avail- 
able. In  this  case  he  will  either  not  use  6 — 6(/;)  resources  or  he  will  run  out  early  (when 
h < bln)). 


Let  h be  the  true  vector  of  resource  levels  with  element  h,  and  h be  the  vector  with 
^1  * f’l  i ^ / and  h,  = h,ln).  We  define  the  loss  associated  with  using  6 instead  of  6 as  Hh.h) 
and  will  take  it  to  be: 


lAb.h) 


(max  p.x\Ax  ^ b,  .x  ^ 0) 

- (max  px\Ax  < 6.  x > 0),  if  6 > 6, 

(max  px\Ax  < 6,  X > 0) 

- (max  pxl-4x  < b.  .x^  ^ 0,  x^  = Ol.  if  6 < b. 


where  Ixf,  ^ 0,  “ 0)  means  that  only  those  vectors  represented  by  5 are  free  to  be  nonne- 

gative.  I he  loss  I unction  indicates  that  when  b b the  ex  ante  loss  will  be  taken  to  be  the 
difference  between  the  profit  that  would  have  been  obtained  if  it  were  known  that  the  resource 
were  actually  b and  the  prolij  that  is  expected  to  be  obtained  by  using  the  program  associated 
with  H and  resource  level  b.  I he  left  over  resources  will  be  considered  waste  and  will  be 
charged  to  the  solution  as  loss.  II  6 < 6 then  we  have  overestimated  the  amount  of  ’■esource. 
Ihe  <’v  ante  loss  that  will  be  incurred  is  the  difference  in  profit  between  what  would  have  been 
obtained  if  the  resource  had  been_ properly  estimated  and  what  is  expected  to  be  obtained  given 
that  we  will  use  the  basis  matrix  but  have  only  6 in  actual  resources. 


Let  77  be  the  vector  ol  simplex  multipliers  associated  with  the  optimal  feasible  basis  (■). 
Then  the  loss  function  can  be  written  as: 

b ^ b. 
b 4,  b. 

This  may  be  further  rewritten  as 


lAb.b) 


JT„.b  - 77^6, 
77H./7  - 77^6. 
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(77-«.  - Trj,)b  + (6  - h)TTi,.  b ^ h, 

{iTg-  — TTg)h,  b ^ h. 

Thus,  the  loss  function  can  be  viewed  as  being  composed  of  the  loss  due  to  using  the  wrong 
basis  plus  the  loss  due  to  overestimation  of  resource  availability. 

We  of  course  do  not  know  b.  We  therefore  assurn^e  that  the  decision  maker  or  analyst  has 
some  prior  information  about  /),,  namely  that  b,  — N(b,cr^).  Thus,  the  posterior  density  func- 
tion on  b,  given  6,,  is: 

f[b,\b,{n)]  = N([nb,(n)(T^  + ba^  / (nif^  + <t^) + rr^)). 

With  the  above  in  mind  we  are  m a position  to  make  the  general  statement  in  the  intro- 
duction more  concrete.  We  will  consider  a solution  to  (LP)  to  be  insensitive  relative  to 
bi(n  + s)  if  the  expected  loss  associated  with  6,  («  -F  .s)  plus  cost  of  sampling  s data  points  cfs) 
is  minimal.  Thus,  we  wish  to  choose  .s  such  that  we  minimize  loss  plus  cost,  i.e., 

min  f Lib.bin  + s))  / l6,  |6,  (n  -f  s)lt/6  + c (s). 

(3) 

The  solution  to  (3),  s*,  would  be  the  amount  of  data  to  add  as  to  make  the  optimal  solu- 
tion to  (LP)  insensitive.  However,  in  that  we  do  not  know  how  -F  s)  will  change  with  s, 
we  must  use  our  best  estimate  of  the  mean,  namely  the  mean  of  the  posterior  E[b\b(n)]. 
Thus  we  will  find  s*  so  as  to  minimize 

(4)  J LiEEibl^n)])  ./,  [b,\b,{n}]cJb  -F  c(.s), 
where 

/,  [b,\b,(n)]  = N{E\b,\b,(n)],  cr^  / [(n  -F  s)  -F  tr"]) 

Minimizing  (4)  is  a heuristic  approach  to  solving  (3).  In  (4)  we  have  used  the  best  esti- 
mate of  the  mean  and  are  only  altering  the  variance  of  me  posterior  density  function.  Using  a 
parametric  programming  routine  on  the  linear  program  optirr.ization  package  will  yield  the  sim- 
plex multipliers  for  all  values  of  b,.  Thus,  as  is  typical  in  tnese  formulations  (see  [2l),  one 
would  compute  the  value  of  (4)  for  different  values  of  s and  r.elect  that  s that  yields  at  least  a 
local  minimum. 

A MORE  GENERAL  PROBLEM 

The  above  procedure  raises  a significant  point,  namely  the  laci  that  it  assumes  that  n data 
points  have  already  been  sampled  and  used  to  form  h,(n).  We  would  in  fact  like  to  find  the 
total  sample  size  in  a one-shot  approach,  thereby  avoiding  the  possibility  ol  having  oversampled 
to  begin  with. 

We  will  assume  here  that  both  parameters  of  the  independent  normal  process  (i.e.,  b,  and 
tr})  are  unknown  and  must  be  treated  as  random  variables.  A sample  ol  n data  points 
(/),! b,„)  will  be  taken,  from  which  the  following  statistics  could  be  computed: 


(1) 


L{b.b)  = 


I 


i 


• - 
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V = 5^  lb„  - mp 

p = n — I 

Thus,  we  wish  to  find  n that  again  minimizes  expected  loss  plus  cost  c{n).  To  formulate  the 
expected  loss  we  assume  that  the  prior  distribution  of  (6,,l/o-,^)  is  normal-gamma  (see  the 
appendix)  where  h,  = \/<t}  and  the  parameter  of  the  prior  is  (w’,_v',  n’,  i>').  Thus  one  might 
use  previous  estimates  of  b,  to  form  im',  v'.  n\  p')  or  use  m'  = 6,  v'  = As  mentioned  in 

the  appendix,  the  posterior  distribution  will  also  be  normal-gamma  with  parameters 
(w  v",  n".  p");  the  unprimed  parameters  reflect  the  sampling-distribution  values.  Since  we 
are  most  interested  in  the  posterior  distributions  associated  with  6,  we  will  use  the  marginal  dis- 
tribution of  (b.blm",  v",  n",  p")  which  (see  appendix)  is  the  Student’s  distribution 
g(bim",  v",  n",  p").  Therefore  the  expected  loss  is 

Lib,  m ")  g(b\m  ",  v",  n" , p")D(m.  v|w',  v',  n',  p'\  n,  p) dmd\db, 

where  D(  ) is  the  sampling  distribution  (see  appendix)  and  the  updating  equalities  above  are 
used.  Finally,  this  means  that  the  optimal  sample  size  /*  is  the  number  of  data  points  that 
solves: 


min  El  (n)  -I-  c(n) 

n 

Evaluating  £/  («),  while  tedious,  will  not  be  difficult,  especially  since  Lib.  m")  is  piecewise- 
linear. 

OTHER  ISSUES 

First,  it  should  be  obvious  that  a similar  aproach  can  be  taken,  via  duality,  for  changes  in 
one  element  of  the  price  vector  p.  More  important,  however,  is  the  problem  of  restricting  the 
analysis  to  one  element  of  the  parameter  set,  be  it  b or  p.  One  would  normally  expect  to  see  a 
sensitivity  discussion  which  considered  changes  in  the  entire  vector,  not  just  one  element. 

A moment’s  reflection  will  bring  to  light  why  this  has  not  been  pursued.  In  standard  sen- 
sitivity analysis  the  right-hand  side  is  parameterized  on  a scalar  and  ranging  is  performed  by 
varying  the  scalar.  This  amounts  to  requiring  the  right-hand  side  to  expand  along  a ray,  which 
does  not  make  sense  in  the  present  application.  On  the  other  hand,  while  the  loss  function  for 
such  a procedure  is  well  understood  and  easy  to  compute  for  various  values  (this  being  a direct 
extension  of  the  material  in  the  previous  sections),  the  loss  function  to  be  used  for  the  problem 
at  hand  (where  b would  not  expand  along  a ray)  is  not  known.  In  fact,  we  would  not  have  the 
benefit  of  not  having  to  solve  the  program  for  each  6,  but  would  essentially  have  to  solve  an 
infinite  number  of  programs  in  order  to  find  the  optimal  b. 

The  obvious  heuristic  is  to  vary  one  element  at  a time.  This  is  clearly  not  optimal.  It 
should  be  clear  that  each  element  of  the  right-hand  side  would  be  a function  of  all  other  ele- 
ments and  that,  for  example,  the  "optimal"  b would  be  sequence  dependent,  i.e.,  would  change 
depending  on  the  sequence  of  6,  examined. 

SUMMARY 

This  paper  is  an  attempt  to  apply  basic  decision  theory  to  sensitivity  questions  in  linear 
programming.  We  have  taken  the  position  that  a solution  is  sensitive  only  if  the  expected  gain 


664 


A.  F.  DAUOHETY 


I 

I 

s 


from  improving  the  solution  exceeds  the  cost  of  improvement.  This  would  seem  to  be  an  obvi- 
ous and  intuitive  approach,  but  it  does  not  seem  to  be  the  approach  commonly  used  in  sensi- 
tivity analysis.t 

While  closed  form  solutions  do  not  appear  possible,  locally  optimal  solutions  are  clearly 
possible  if  the  user  has  the  facility  to  do  basic  sensitivity  analysis  and  has  access  to  (or  will 
acquire)  a computer  routine  for  numerical  integration. 
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APPENDIX 


The  following  is  shown  in  (2i: 

I.  If  >7  is  distributed  Gamma-2  then  we  mean  that 


l'^2(h\\,  v)  = e 


wh 


T-' 


wjl 


Thus,  if  (p..  It)  is  distributed  Normal-gamma  then  ./vy  (fjc.  Ii\m,  v,  n,  i>)  = /v  (p-l'n, 
hn)f^j  (ft\v.  e). 


2.  If  the  prior  distribution  of  (p,  h)  is  Normal-gamma  with  parameter  ( m' , v',  n',  e') 
and  if  a sample  then  yields  a sufficient  statistic  (in,  v,  n,  v),  the  posterior  distribution  of  (p.  h) 
is  Normal-gamma  with  parameters  (m",  v".  n",  v"): 

m"  = (n'm’  + nm)l(n'  -F  n), 

n"  — n'  + n. 

v"  = v'  + 1/  + f>(n')  -F  fiin)  — Sin"), 

v"  = ((p'v'  -F  n'(m')M  + [r-v  + nin^]  — n"  (tn")^)/v". 


where 


8(.v) 


!0  if.v=0, 

ll  if  X > 0. 


) 


I 


' k.- 


FA  rcceni  excx’i’lion  is  1 1 1 


I 


LINEAR  PROGRAM  RIGHT-HAND  SIDE 


3.  If  the  joint  distribution  of  i/x.  h)  is  Normal-gamma  as  defined  above,  then  the  margi- 
nal distribution  of  fj.  is  the  Student  distribution 


V,  tl.  (/)  = 


2 


I'  ^ (fi  — m)^ 
V 


V! 


4.  If  a sample  of  size  n > 0 is  to  be  taken  from  an  independent  Normal  process  with 
parameter  (/x,  h)  having  a Normal-gamma  distribution  with  parameter  (in',  v'.  n',  i>'),  with 
(v',  n',  ir'  > 0),  then  the  unconditional  Joint  distribution  of  the  statistic  (m.  v)  is 

Dim,  v|w',  v',  n,  u)  = Ain) 

(p'v'  -I-  j/v  -t-  n„  [in  — in']^)‘’ 

where 

III,  = n'li/in  -I-  n'), 

1'  " = i>  ' + V + \ , 


A ill)  = 


r(»^72) 
■ '>  - ' 
2 


r-r- 

n - 1 

n + n’ 

2 

i 

"V  27r  Viu'/ 


r(.v)  = gamma  function  of  x. 
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TWO  QUEUEING  SYSTEMS  SHARING  THE  SAME  FINITE  WAITING  ROOM 

M.  J,  Fischer 

Defense  Communications  Engineering  Center 
Reston,  Virginia 

ABSTRACT 

In  this  paper,  we  consider  the  analysis  of  two  M/G/1  queueing  sys- 
tems sharing  the  same  finite  waiting  room.  An  exact  analysis  is  given  for 
several  special  cases,  and  then  an  algorithm  is  developed  which  approximates 
the  system  behavior  for  the  general  problem.  Comparisons  are  made  between 
the  special  cases  and  the  algorithm. 

1.  INTRODUCTION 

In  this  paper  we  consider  the  case  in  which  two  classes  of  traffic  arrive  at  the  same  finite 
waiting  room  (K  spaces).  There  are  two  . servers,  one  for  each  type  of  customer.  A server  is 
allowed  to  service  one  and  only  one  class  of  customer.  We  assume  that  the  customers  arrive  in 
accordance  with  two  independent  Poisson  processes  with  parameters  i — 1,2.  The  random 
variables  5;,  representing  the  time  to  service  a class  / customer,  are  assumed  to  have  a general 
distribution  function,  which  is  independent  of  each  other  and  of  the  arrival  processes.  If  the 
server  for  a given  class  of  customer  is  busy  upon  the  customer’s  arrival  and  all  the  K spaces  are 
full,  the  customer  leaves  the  system  without  receiving  service.  We  will  be  considering  the  sys- 
tem in  the  steady  state. 

The  case  of  only  one  class  of  customers  has  been  considered  previously  13,4,6]  and 
recently  there  has  been  some  additional  interest  in  the  single-server  case  [5,7].  However,  to 
the  best  of  our  knowledge  no  work  has  been  done  on  the  case  where  there  are  two  classes  of 
customers  and  a server  for  each  class. 

Such  a system  could  serve  as  a model  for  the  buffer  in  a data  telecommunications  system. 

In  designing  a data  telecommunications  network,  the  buffers  (waiting  rooms)  are  usually  sized  i 

for  a negligible  probability  of  overflow  [8].  The  effect  of  buffer  size  (where  the  entire  buffer  ^ 

contents  are  served  by  a single  server)  has  been  considered  [1,2].  However,  in  a real  system  a ? 

buffer  may  store  messages  (customers)  for  several  nodes  (servers).  Thus,  the  buffer  is  a f" 

shared  resource  for  messages  destined  for  these  nodes  and  the  service  each  type  of  message  ! 

receives  depends  on  the  level  of  traffic  of  the  other  messages  using  the  buffer. 

For  the  system  considered  in  this  paper,  the  analysis  of  the  general-service-distribution 
case  is  complicated  and  not  amenable  to  a straightforward  analysis.  In  section  2,  we  give  an 
analysis  for  some  special  cases.  An  algorithm  which  approximates  the  steady-state  behavior  of 
the  system  with  general  service-time  distributions  is  presented  in  section  3.  Comparisons  with 
the  results  obtained  in  section  2 are  given  at  the  end  of  section  3 and  in  the  numerical  exam- 
ples considered  in  section  4. 
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2.  SOME  SPECIAL  CASES 

Three  special  cases  are  considered  in  this  section.  Let  Q,.  i = 1,  2,  be  the  steady-state 
number  of  class  / customers  in  the  system  (in  service  and  waiting),  and  define 
/>  ^ = Pr(0|  = i,  Qj  = 7).  We  note  when  / = 0,  1,  then  P, , = 0 for  J^K  -t-  2,  and  when 
2 < / < A -t-1,  then  P, , = 0 for  / ^ A -I-  3 - /.  Within  a class,  service  is  rendered  on  a 
first-come,  first-served  basis. 

a.  A = 0.  For  the  case  where  there  are  no  waiting  spaces  (K  = 0)  each  class  of  custo- 
mer behaves  independently  of  the  other  and  as  a single-server  loss  system.  Thus,  if 
p,  = X,  E{S,\  a = 1,  2),  then,  for  / « 0,  1 and  / = 0,  1,  we  have 

(,)  P = . 

(1  -bp,)  (1  -Epj) 

b.  Exponential  Service  for  Both  Classes.  For  the  case  where  each  class  has  an  exponential 
service  distribution,  with  £"{5,1  = 1/m„  i = 1,  2,  the  steady-state  equations  for  P , , are 

(2)  (Xi-t-X2)Fo.o“f‘i^i.o  + M2£’o.  i> 


(X,  + Xj  -(-  til)  Pn,i  = Ml  /'l,  + ^2  ^0.;-l  + M2  ^ 0./  + 1-  I < y < A, 


(p,  + Pj)  Pi.K+2-,  “ ^1  £’,-I.A+2-,  + ^2  /’/.A  + I-m 
(X2  + P|)  P A + I.O  ” X,  P 0 -b  P2  Fa+I.I> 

(m  I + M2)  )^Afi.  I ” ^ I X2  Pa  +1. 1- 
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The  solution  to  these  equations  is  (p,  = \ Jtx) 

^-./=^OOPl'P2' 

where 


P 0.0  — 


-P|)(l  -P2)(P2-P|) 


(1  -Pl*^^)(P2-P|)-P2(l-Pl)(p2^^^-pA^^+p,P2(l-p,)(l-pj)(pf^l-p,Af  + l) 


From  equation  (3)  the  marginal  probability  distributions  of  (?,  and  are  obtained 
standard  manner.  We  present  results  for  0,  (the  analogous  results  can  be  obtained  for  O 
symmetry): 


in  a 
2 via 


(4) 


PrlO,  = /)  = 


o.oPi" 


1 

' 1 - P2 

1 -p-^^J-' 


I - 


P2 


/ = 0 . 

/ = 1,  2 K +1  . 


From  (4)  we  get  the  expected  number  of  class-1  customers  in  the  system  £|G>|)  to  be 


(5)  £lC>il  = 


F 0.0 

- P2 


Pi 


(I  - - (A(  -E  2)  (1  - p,)pA^' 


(1  -P,)' 


-Pl  P2 


-PI*  ^^)  -{K  +2)  p,*^'  ip,  - P,) 


(P2  - P|)^ 


Another  measure  of  performance  for  class  I customers  is  the  probability  of  blocking  upon 
arrival,  PB^  \ this  quantity  is  given  by 


A + l 


(6)  PB^  X F,,x+2-, 


= Pn 


Pl  ^ (P2  ~ P|)  + Pi  P2  (P2*'  - Pi*  ^') 


(P2  - P|) 


It  IS  interesting  to  note  that  the  probability  of  blocking  for  class  2 customers  is  not  the  same  as 
Pfl,,  even  though  their  arrival  processes  are  Poisson.  This  stems  from  the  fact  that  the  states 
of  the  system  under  which  customers  are  blocked  are  different  for  different  classes  of  traffic.  A 
final  measure  of  performance  for  the  class  I customer,  the  expected  waiting  time  E[W.\  is 
given  by  Little’s  Formula 


(7) 


E[W^]  = £(C>,)/(1  - PS,) 


I 

r 

' I, 


c.  K ^ I,  Class  I Service  Exponential.  Class  2 Service  Erlang  2.  As  mentioned  in  section 
I,  the  analysis  of  the  case  of  general  service  times  is  not  straightforward.  Using  standard 
methods  of  analyzing  such  systems,  i.e.,  supplementary  variables  or  embedded  Markov  Chain, 
does  not  yield  results  directly.  It  does  appear,  however,  that  the  case  of  Erlangian  or  hyperex- 
ponential service-time  distributions  can  be  handled  using  the  method  of  phases  [9).  We  will 


670 


M J FISCHER 


demonstrate  how  one  could  go  about  the  solution  of  this  system  for  the  special  case  of  K = 1, 
exponential  service  distribution  for  class  1,  E{Si\  = I/mw  and  an  Erlangian  service  distribution 
with  shape  parameter  equal  to  2 (£(52)  = 2//X2)  for  class  2.  Thus,  every  class  2 customer  who 
is  accepted  into  the  system  brings  in  two  phases. 

Let  R,i  represent  the  probability  of  Qi  = /,  and  let  the  number  of  class  2 phases  in  the 
system  equal  j.  The  steady-state  equations  for  the  nonzero  values  of  /?„  are 

(a I -F  X2)  ■^00  “ Ml  £ 10  + £01  " 

(X|  -F  X2  -F  ^l2)  Roi  ~ Ml  £|0  + M2  7?02  • 

(8)  (X]  -F  X2  ^02  “ Ml  £ 12  + ^2  £00  + M2  £03  ■ 

(X1-FM2)  ^03  “Ml  7f|3‘t"^2£oi  +M2  £04  ' 

(Xi  -F  fl2)  ^04  “ Ml  £ 14  + ^2  £02  ' 


(X1-FX2  + M1)  £io“Mi£20  + M2£u+^i7?00’ 

(X  I -F  X2  + Ml  +M2)  ^ll“^l  £o1  + M|  £21  + M2^I2- 

(9)  (^l  + X2+M1  + M2)  ^I2“^l  ^02  + MI  £22"*'^2'^I0  + M2^I3- 

^Mi  + M2)  ^13“^1  ^03  + ^2^11  +M2  £ 14  - 
(Mi  + M2)  £ 14  “ ^1  £04+  ^2  £ 12  ' 

and 


(X2  + M 1)  ^20=  ^ \ R 10+  M2  £ 21  > 
(MI  + M2)  )^2I  “^1  ^11  + M2  £22  ' 
(M1  + M2)  7?22“^i£i2"^^2£22- 

Now  let  T),  = X/mi  ('  “ li  2),  a = m.2/mi>  and 


((oo 

((  10 

R 20 

£oi 

((ll 

Rii 

((02 

. «1  = 

«12 

, and  R 2 = 

Rii 

£o3 

Rm 

0 

£o4 

R 14 

0 

From  equation  (8)  we  get  the  following  matrix  equation; 


(ID 

where 


R I — A I Ro 


T),-FaT)2 

—a 

0 

0 

0 

0 

39i+«392+“ 

—a 

0 

0 

-“392 

0 

3}|+“3l2+« 

—a 

0 

0 

-ari2 

0 

Vl+a 

—a 

1 0 

0 

-“392 

0 

39l+“ 
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and  from  the  first  three  equations  in  (9)  and  equation  (11) 
(12)  R 2 ^ A 2 R \ — B2  Rq 

=•  (A  2 A f Rq  < 

where 


and 


T),-Fai72-Fl 

—a 

0 

0 

0 

0 

7Ji-F«tj2-F1  -Fa 

-a 

0 

0 

-ar)2 

0 

7)i-Fa7)2-Fl-Fa 

—a 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

7)1  0 

0 

0 

0 

0 V\ 

0 

0 

0 

. B2 

= 

0 0 

V) 

0 

0 

0 0 

0 

0 

0 

0 0 

0 

0 

0 

From  equations  (11)  and  (12)  one  can  see  that  the  solution  rests  on  determining  the  five 
unknowns,  Rq,,  j = 0,  1,  2,  3,  4.  Thus  we  need  to  generate  five  independent  equations  in 
these  unknowns.  From  (8),  (9),  and  (10)  there  are  still  five  equations  (the  last  two  from  (9) 
and  the  three  from  equation  (10)),  but  one  of  them  is  redundant.  Dropping  the  fourth  equa- 
tion of  (9)  gives  us  four  independent  equations;  the  final  one  can  be  found  from  the  normaliz- 
ing condition  ^ R,)  =1.  Denote  C,  = /t,,  and  C2  = A2  Aj  — B2  and  C,  (/,,/)  as  the  /,7  th  entry 


U 

of  C,;  then  from  equations  (Hi  and  (12)  for  7 =0,  1,  ...  ,4  we  have 


(13) 


KU  = I C,  (7»  . / = 


1,2. 


The  required  five  equations  now  become 
4 

^ 1(1  -F  a)  Cl  (4,n)  -a  1)2  C|  (2,n)]  Ron  -tj,  /?o4  = 0 , 

fl— 0 

^ l(aT)2  + 1)  Cj  (0,n)  - aC2  (1,«)  - tJ]  C|  (0,n)l  Rq„  = Q , 

n~0 

(14)  £ [(1  -F  a)  C2  (1,/)  - aC2  (2,/)  - 7,1  C,  (1,/)]  /?o„  =0, 

n“0 


£ ((1  + a)  C2  {2,n)  - r)2aC2  (0,n)  - 7,1  C,  (2./)]  Ron  “ 0 
/»-0 


4 

I 

n^O 


1 + t i C,  U.n) 

i-I  /-O 


^On  =1 


Thus,  the  solution  to  this  special  case  rests  on  solving  the  five  independent  equations 
given  by  (14).  It  appears  that  one  could  generalize  the  method  just  described  for  solution  of 
the  case  of  Erlanjgian  service  distribution  and  general  K.  But  use  of  this  method  would  require 
solution  of  a large  system  of  equations.  In  the  next  section  we  present  an  algorithm  which  can 
be  used  to  approximate  quickly  and  efficiently  the  steady  state  performance  for  general  service- 
time distributions. 
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3.  AN  ALGORITHM  APPROXIMATING  THE  GENERAL 
SERVICE  TIME  DISTRIBUTION  CASE 

The  algorithm  approximating  the  case  of  general  service-time  distributions  is  based  on  the 
following  observation:  When  there  are  ./(  > 1)  class  2 customers  in  the  system  until  the  next 
class  2 arrival  or  departure,  whichever  is  first,  the  system  behaves  like  an  M/G/l/K -I- 1-j  sys- 
tem for  the  class  1 customers.  Analogously,  the  same  sort  of  phenomenon  happens  with 
respect  to  the  class  2 customers. 

Now,  from  the  definition  of  conditional  probability,  we  have 
(15)  Pr(C>i  = /|(?2  = ./1  Prl(?2  = 7l  = Pr((?i  = '■.  C^2  = 7l 


The  algorithm  uses  equations  (17)  and  (18)  to  generate  an  approximation  to 
Pf\Q\  = i.  02  = ./(-  However,  the  conditional  probabilities  Pr{^i  = /|^2  “ 

Pri(?2  = 7lCi  “ ' 1 afe  "o*  known.  As  stated  earlier,  when  Q2  = j the  system  instantaneously 
behaves  as  an  M/G/l/K -I-  1-j  system  for  the  class  1 customers,  and  so  we  use  known  results  for 
these  systems  to  approximate  Pr{Qi  = /|02  “ ./)  [3]  or  [4]).  That  is,  when  Q2  = ./we  use 

the  state  probabilities  obtained  from  an  M/G/l/K-l- 1-J  system  where  only  class  1 customers  are 
arriving.  We  note  when  j = 0 and  1 the  conditional  probabilities  are  obtained  from  the  state 
probabilities  for  an  M/G/l/K  queueing  system.  Analogously,  PrlC2“  ./l0i  “ '1  approxi- 
mated using  the  class  2 results  from  an  M/G/l/K-f  I-i  system.  Let  us  denote  these  approxima- 
tions by  _ , and  _ ,,  and  all  other  approximate  probabilities  by  the  use  of  a circumflex 

('). 

The  algorithm  first  approximates  Pr(C/|  = /,  Q2  = ./I  by  conditioning  on  the  number  of 
class  2 customers  in  the  system.  Since  class  2 results  are  being  used,  we  denote  this  approxi- 
mation by  it  is  given  by 

.(19)  A.?’ “ Alo,.;  Pr  102  = ./). 
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where 

(20) 


Prl(^2  = /) 


V'  ~ ' 

S ^,1(2,., 


For  7=0  and  J = 1 we  use  equation  (20)  to  determine  Pr{02  = /).  Once  these  quantities 
have  been  obtained,  P,*/’  (for  ; = 0,  1,2.  ....  /£"  + 1 and  7 = 0, 1)  can  be  found  from  equa- 
tion (19).  When  - 2,  + and  F2|o,-a+i  "o*  make  sense,  but  from  the  definition 

of  conditional  probability  we  have 


^A  + i|y;-2  Pr|0|  = a:  -El) 

^2|y,-A>i  ~ Pr((?2  = 2l 


Using  equation  (20)  we  now  get 

(22)  Pr(02  = 2l  = 


1 - Pr(g|  - + 1) 


Since  for  / = 0 and  / = 1 we  have  generated  PZ  + i ,)  and  Pl^l^  we  have 

Pr[g,  = + 1)  = 


and  so 


(23) 


Pr((?2  = 2l 


I - PlK^.n  - P 


(2) 

A*  + 1 . 1 


^p 

,-0  " 


,!y,-2 


Using  equations  (19)  and  (23),  we  now  can  approximate  P,  2 for  / = 0,  1,  2 A'.  For 

7 > 3,  we  can  sequentially  develop  an  expression  for  P’r{C)2  = ./l  using  equation  (20)  and  the 
results  previously  generated  for  P'^\  n = 0.  1,  2 - I.  For  7 = 2 A',  one  gets 

A + l XV2-II 

1-  I I pj.r 

(24)  Pr|0^  = /)  = . 

and  Pr{C)2=  AT  + 1 ) = 1 - i;Prl02  = ,/). 

,-o 


In  a similar  fashion  we  can  use  equations  (18)  and  (19)  to  generate  an  approximation  to 
PrlOi  = C^2  = ./l  conditioned  on  the  class  1 customers  in  the  system,  denoted  by  P,j'\  In 

general,  it  turns  out  that  P,j'' ^ and  the  final  approximation  to  PrlC),  = /.  C>2”  ./1< 
denoted  by  P,  ,,  is  given  by 


(25) 


Pi  p,:r  + P2  p,'; 

Pi  + P2 


(2) 


where  p,  = £(.S’,),  / = 1,2. 
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The  steps  of  the  algorithm  can  now  be  summarized: 


STEP  1;  Use  equation  (20)  to  determine  P*r(02  = j)  for  j = 0, 1.  Use  equation  (19)  to 
determine  A.?*  for  j =0.1  and  / = 0,  1 K + 1. 

STEP  2:  For  ; = 2,  3 K use  equation  (24)  to  determine  PrliJj  = ,/l  and  equation 

(19)  to  determine  for  / = 0.  1,  ...  K + 2 — j. 


STEP  3:  Set  P'r{02 ^ + U = 1 “ LP''(02  = 71  and  use  equation  (19)  to  determine 
P dj<+i  and  P 

STEP  4:  Return  to  step  1 to  determine  the  approximation  to  Pr(Qi  = i.  = j\  based  on 
class  1, 


STEP  5;  Once  A'/*  A.?’  have  been  determined  set  P, , 


Pi  A.V’  + P2  A?* 
P1+P2 


Once  P,i  has  been  determined, 
use  of  standard  results. 


all  the  desired  measures  of  performance  can  be  obtained  by  the 


Several  comments  regarding  the  accuracy  of  the  algorithm  can  be  made.  For  the  case 
where  /(  = 0,  the  algorithm  is  exact.  We  have,  for  / = 0.  1 and  y = 0,  1, 


(26) 


'l(?2 


Pi 

1 + Pi 


and 

(27) 


pi 

1 + Pi 


Using  equation  (20),  for  j = 0,  1,  we  get 

-1 


Pr((32”7} 


--0  Pi\Q^-, 


I 

f 


Pi 

1 + Pi 


P2' 

1 + P2’ 


1 + P2 
Pi' 


so,  for  / “ 0,  1 and  y = 0,  1, 


I 


I 


I 
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p 


!*■ 


(29) 


P1P2' 

(1  + Pi)(l  + P2) 

Pi  P2' 

1 +P1  + P2  + Pi  P2  ’ 


which  is  equation  (1).  Analogously,  one  can  see  that  P/,"  = and  so  the  algorithm  gen 
erates  the  exact  results  for  the  case  when  A = 0. 


For  the_  cas^of  exponential  service-time  distributions  and  general  A",  section  2b,  it  can  be 
shown  that  P, , = P, , and  the  algorithm  generates  exact  results.  Using  the  results  of  section 
2b  to  obtain  an  expression  for  Pr{C>2  = y |C>  = /)  (for  / > 1 and  ,/  > 1),  we  have 

Pr{(?i  = /,  02  = 7) 


(30)  Pr{02  = 7l0,  = /I  = 


Pr(0,  = 1] 

Pflo  Pi  Pi 


PflO  Pi 


1 -P2' 


k+}-i 


= P2 


1 - P2 
(1  -P2) 

1 


But  equation  (30)  is  exactly  the  result  we  would  have  obtained  for  P,|y  by  using  the 
state  probabilities  for  an  M/M/l/K-F  1-i.  Thus,  P,|y^.,  is  exact  and  can  be  used  along  with 
quite  a bit  of  algebra  to  show  that  the  algorithm  generates  exact  results  for  this  case. 


In  the  next  section  we  give  numerical  comparisons  of  the  algorithm  for  the  situation 
where  K > 0 and  service  distribution  is  not  exponential.  There  the  algorithm  is  not  exact  but 
It  performs  very  well.  ’ 


4.  NUMERICAL  RESULTS 


In  the  first  part  of  this  section  we  give  some  numerical  comparisons  of  the  algorithm  for 
some  situations  where  A > 1 and  the  service  distributions  are  not  both  exponentially  distri- 
buted. The  section  closes  by  using  the  results  of  the  analysis  of  section  2b  to  answer  the  ques- 
tion as  to  whether  or  not  to  have  separate  waiting  facilities  for  each  class. 


Table  1 is  a comparison  of  the  algorithm  presented  in  section  3 and  the  special  case  of  K 
- I,  one  service-time  distribution  exponential,  and  the  other  £2  (section  2c)  For  a fixed  com- 
bined load  of  0.9,  comparisons  between  the  exact  (E)  and  the  approximate  (A)  results  are 
presented  for  various  measures  of  performance.  It  is  immediately  discernible  that  the  algorithm 
IS  quite  accurate  for  all  the  cases  presented.  In  a relative  sense,  the  algorithm  seems  to  be 
worse  when  the  traffic  intensities  are  equal.  Furthermore,  when  one  of  the  traffic  intensities  is 
zero  the  algorithm  gives  the  exact  results. 
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TABLE  1.  Comparison  of  algorithm  and  the  case 
where  one  service  rime  distribution  is  exponential 
and  the  other  Erlangian  (K  = 1,  Ip  \) 


Traffic 

Intensities 

E(Q\) 

^(02) 

PS, 

PB, 

COV  (Q^,  Qf) 

1 

2 

* 

. ...  . 

E 

0, 

0.92731 

0. 

0.27306 

0, 

0 

0.9 

A 

0. 

0.92731 

0. 

0.27306 

0. 

E 

0.10407 

0.84445 

0.02824 

0.23825 

-0.00511 

0.1 

0.8 

A 

0.10406 

0.84470 

0.02827 

0.23851 

-0.00474 

E 

0.21437 

0.75031 

0.05825 

0.20491 

-0.01563 

0.2 

0.7 

A 

0.21433 

0.75100 

0.5835 

0.20557 

-0.01474 

E 

0.32799 

0,64783 

0.08987 

0.17306 

-0.02557 

0.3 

0.4 

A 

0.32789 

0.64884 

0,09001 

0.17391 

-0.02456 

E 

0.44204 

0.53987 

0.12288 

0,14264 

-0,03098 

0.4 

0.5 

A 

0.44189 

0.54095 

0.12303 

0.14334 

-.03033 

E 

0.49834 

0,48469 

0.13982 

0.12789 

-0.03170 

0.45 

0.45 

A 

0,49819 

0.48571 

0.13997 

0.12842 

-0.03101 

E 

0.55373 

0.42914 

0.15702 

0.11343 

-0.03017 

0,5 

0.4 

A 

0.55358 

0,43003 

0.15714 

0.11374 

-0.03011 

E 

0.66049  • 

0.31804 

0.19199 

0.08510 

-0.02363 

0.6 

0.3 

A 

0.66038 

0.31861 

0,19207 

0.08502 

-0.02405 

E 

0.76002 

0.20857 

0.22750 

0.05720 

-0.01373 

0.7 

0.2 

A 

0.75997 

0.20881 

0.22753 

0.5693 

-0.01426 

E 

0.85036 

0.10223^ 

0.26323 

0.02909 

-0.00429 

0.8 

0.1 

A 

0.85035 

0.10227 

0.26323 

0.02909 

-0.00454 

E 

0.92989 

0. 

0.29889 

0. 

0. 

0.9 

0 

A 

0,92989 

0. 

0,29889 

0. 

0. 

E ” exaci;  A “ approxiniale 
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In  Table  2 a comparison  of  the  algorithm  and  the  results  of  a simulation  are  given  for  the 
case  K = 5,  with  one  service  distribution  Ei  and  the  other  E^.  For  various  combinations  of 
loads,  comparisons  between  the  simulation  (S)  and  the  approximate  (A)  results  are  presented 
for  various  measures  of  performance.  Although  the  approximation  does  very  well,  it  does  not 
appear  to  be  as  accurate  as  in  the  previous  comparisons.  To  some  extent,  this  may  be  attri- 
buted to  the  fact  that  this  comparison  was  against  a simulation,  with  its  own  inherent  inaccura- 
cies. However,  the  results  are  certainly  representative  and  the  approximation  appears  to  be 
quite  suitable  for  most  applied  work. 


TABLE  2.  Comparison  of  algorithm  and  the  case 
where  one  service-time  distribution  is  Ei 
and  the  other  £4  (K  = 5,  £(5,)  = E(Si)  = 1) 


Traffic 

Intensities 

• 

£(02) 

PB^ 

PB2 

1 

2 

S 

0.117 

0.103 

0.000 

0.000 

0.1 

0.1 

A 

0.108 

0.108 

0.000 

0.000 

S 

0.857 

0.108 

0.003 

0.000 

0.5 

0.1 

A 

0.854 

0.107 

0.003 

0.000 

S 

2.557 

0.114 

0.081 

0.005 

0.9 

0.1 

A 

2.539 

0.112 

0.077 

0.006 

S 

0.103 

2.656 

0.005 

0.073 

0.1 

0.9 

A 

0.107 

2.545 

0.007 

0.068 

S 

0.807 

2.350 

0.038 

0.070 

0.5 

0.9 

A 

0.766 

2.416 

0.040 

0.074 

S 

2.130 

1.980 

0.128 

0.120 

0.9 

0.9 

A 

2.055 

2.032 

0.121 

0.119 

•S  — simulation;  A — approximate 


Part  of  the  reason  for  investigating  this  queueing  system  was  to  try  to  answer  the  follow- 
ing question:  "Are  there  conditions  under  which  it  is  better  to  give  each  class  of  customer  his 
own  waiting  spaces  and  not  allow  the  other  class  to  use  them?"  We  have  used  the  results 
obtained  for  the  exponential  service-distribution  case,  section  2b,  to  investigate  this  question 
numerically.  Table  3 gives  the  results  of  this  investigation.  For  different  values  of  K,  three 
basic  cases  were  considered:  p,  > 1,  / - 1,  2;  p,  < 1,  ^ and  P,  < 1.  < “ 1,  2.  For 
Pi,  P2  and  K fixed,  the  total  load  carried  for  the  case  where  both  classes  are  allowed  to  use  all 
K spaces  (same)  is  compared  with  the  load  carried  for  the  case  where  the  K spaces  are 
optimally  divided  among  the  classes  and  a customer  may  only  use  the  spaces  allowed  to  his 
class  (sep  ).  The  optimal  allocation  of  the  K spaces  to  each  class  (given  in  parentheses)  was 
found  by  formulating  the  problem  as  a dynamic  programming  program,  where  the  objective 
function  was  to  maximize  the  total  load  carried  for  the  system.  From  the  table,  one  sees  that  if 
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both  traffic  intensities  are  greater  than  1,  use  of  separate  waiting  facilities  for  each  class  is 
optimal,  and  for  the  case  where  one  class’s  traffic  intensity  is  less  than  1 and  the  other’s  greater 
than  1,  the  solution  depends  on  K and  on  the  traffic  intensities.  For  the  case  where  both  traffic 
intensities  are  less  than  1,  it  is  better  for  both  classes  to  use  the  same  spaces.  It  should  be 
pointed  out  that  a more  detailed  study  of  the  optimal  operation  of  this  system  has  been  given 
by  Latouche  [10). 

TABLE  3 . Comparison  of  same  or  separate  waiting 
facilities  for  each  class  of  customers 


Total  load  carried  for  separate  and  same  spaces 
(and  optimal  allocation  for  separate  spaces) 


Hi 

K = 3 

K - 5 

K 

- 7 

K 

- 9 

mm 

Sep. 

Same 

Sep. 

Same 

Sep. 

Same 

Sep. 

Same 

1.1 

1.9 

1.6310 

(2,1) 

1.6150 

1.6820 

1.8330 

(4,3) 

1.7070 

1.7170 

1.2 

1.8 

1.6480 

(2,1) 

1.6390 

WrilBl 

mam 

1.7170 

1.855 

(4,3) 

1.7520 

mi 

1.7690 

1.3 

1.7 

msBM 

1.6560 

■l^ni 

matm 

1.7440 

1.8690 

(4,3) 

1.7890 

1.9130 

(5,4) 

1.8140 

1.4 

1.6 

mSSSM 

1.6670 

MBSm 

1.7620 

■MeUI 

KH 

1.8150 

1.8470 

1.5 

1.5 

1.6650 

(2,1) 

1.6690 

1.8010 

(3,2) 

1.7680 

1.8760 

(4,3) 

1.8240 

1.9210 

(5,4) 

1.8590 

0.1 

1.7 

1.0380 

(0,3) 

1.0420 

1.0730 

(0,5) 

1.0780 

1.0890 

(1,6) 

1.0900 

1.0960 

(1,8) 

1.0940 

0.3 

1.5 

1.1570 

(1,2) 

1.1890 

1.2320 

(1,4) 

1.2380 

msSm 

1.2580 

1.2660 

0.5 

1.3 

1.2670 

(1,2) 

1.3030 

MSam 

1.3740 

1.4100 

(2,6) 

1 .4070 

1.4250 

0.7 

1.1 

1.3280 

(1,2) 

1.3790 

1.4410 

(2,3) 

1.4760 

■nkife 

1.5310 

1.5550 

(4,5) 

1.5650 

0.9 

0.9 

1.3400 

(1,2) 

1.4060 

1.4650 

(2,3) 

1.5160 

1.5840 

KH 

1.6310 

0.1 

0.8 

0.7934 

(0,3) 

0.8009 

0.8378 

(0,5) 

0.8461 

0.8599 

(0,7) 

0.8685 

0.8750 

(1,8) 

0.8809 

0.2 

0.7 

0.8061 

(0,3) 

0.8337 

0.8535 

(1,4) 

0.8706 

0.8752 

(1,6) 

0.8863 

ini 

0.8936 

0.3 

0.6 

0.8210 

(1,2) 

0.8553 

0.8610 

(1,4) 

0.8847 

0.8828 

(2,5) 

0.8945 

0.8981 

0.4 

0.5 

0.8256 

(1,2) 

0.8659 

im 

0.8908 

itti 

0.8976 

0.8936 

(4,5) 

0.8994 

0.45 

0.45 

HEl 

0.8671 

liSi 

0.8914 

0.8850 

(4,3) 

0.8978 

mi 

0.8994 
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ABSTRACT 

A primal  simplex  procedure  is  developed  to  solve  transportation  problems 
with  an  arbitrary  additional  linear  constraint.  The  approach  is  a specialization 
of  the  Double  Reverse  Method  of  Charnes  and  Cooper.  Efficient  procedures 
for  pricing-out  the  basis,  determining  representations,  and  implementing  the 
change  of  basis  are  presented.  These  procedures  exploit  the  pure  transportation 
substructure  in  such  a manner  that  full  advantage  may  be  taken  of  the  compu- 
tational schemes  and  list  structures  used  to  store  and  update  the  basis  in  codify- 
ing the  MODI  method.  Furthermore,  the  pricing-out  and  change-of-basis  pro- 
cedures are  organized  in  a manner  that  permits  the  calculations  for  one  to  be 
utilized  in  the  other.  Computational  results  are  presented  which  indicate  that 
this  method  is  at  least  SO  times  faster  than  the  state-of-the-art  LP  code, 
APEX-lll.  Methods  for  obtaining  basic  primal  'feasible'  starts  and  'good'  feasi- 
ble integer  solutions  are  also  presented. 


I.  INTRODUCTION 


1 

\ 


In  this  paper  we  address  a linear  programming  problem  whose  constraints  consist  of  the  i ' 

standard  transportation  problem  plus  one  extra  constraint.  Such  linear  models  occur  frequently  I 

in  transportation  applications  and,  as  Dantzig  [5]  points  out,  the  extra  constraint  often  appears  I ^ 

in  the  form  of  a lower-  or  upper-bounded  partial  sum  of  some  of  the  activities.  Charnes  and  , f 

Cooper  [2]  suggest  that  in  many  applications  the  extra  constraint  may  simply  express  the  pro-  F 

portionality  or  equality  of  particular  variables  for  example,  forcing  two  routes  to  have  equal  f!  - 

flow.  The  warehouse  funds-flow  model  discussed  by  Thames  and  Cooper  [2]  is  a specific  appli-  I 

cation  which  may  appear  in  the  form  of  a transportation  problem  with  one  extra  constraint.  }., 

Another  application  is  a constrained  version  of  Wagner’s  employment-scheduling  problem  (20).  1^ 


•This  research  was  partly  supported  by  ONR  Contracts  NOOOI4-67-A-OI26-OOOg  and  N00014-67-A-OI 26-0009  with  the 
Center  for  Cybernetic  Studies,  The  University  of  Texas  at  Austin,  and  by  ONR  Contract  N0OOI4-76-C-O383  with  Deci- 
sion Analysis  and  Research  Institute.  Reproduction  in  whole  or  in  part  is  permitted  for  any  purpose  of  the  United 
States  Government. 
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Operations  research  literature  contains  a number  of  ingenious  techniques  for  transforming 
a transportation  problem  with  an  extra  linear  constraint  into  a larger  equivalent  transportation 
problem.  For  instance,  {2,3,4,5,13,15,17,18)  are  indicative  of  the  interest  in  this  problem. 
However,  these  transformations  are  not  possible  with  an  arbitrary  extra  constraint.  Thus,  in 
this  paper  we  develop  a primal  simplex-solution  procedure  which  exploits  fully  the  topological 
structure  embedded  in  this  problem.  One  of  the  main  purposes  is  to  specify  computationally 
efficient  procedures  for  determining  the  dual  evaluators  and  updating  the  basis.  Additionally, 
an  efficient  method  for  finding  the  representation  of  any  entering  vector  with  respect  to  the 
current  basis  is  developed.  Thus  our  solution  method,  like  the  MODI  method  [5]  and  the 
Row-Column  Sum  Method  [2],  only  requires  the  knowledge  of  the  basis  rather  than  the  basis 
inverse  to  be  able  to  implement  the  simplex  method  efficiently.  Furthermore,  these  procedures 
exploit  the  structure  in  such  a manner  that  they  allow  us  to  take  full  advantage  of  the  computa- 
tional schemes  and  list  structures  (1,7,9,12)  used  to  store  and  update  the  basis  in  codifying  the 
Row-Column  Sum  Method.  Our  approach  also  allows  for  the  use  of  advanced  start  procedures. 
For  instance,  we  show  how  the  Row  Minimum  Rule  (6)  and  Vogel’s  Approximation  Method 
(17)  for  the  pure  transportation  problem  can  be  extended  to  yield  good  starts  for  the  singly  con- 
strained problem. 

Since  the  extra  constraint  may  destroy  the  integrality  property  of  the  pure  transporation 
problem,  an  optimal  solution  may  be  noninteger.  Thus  another  purpose  of  this  paper  is  to  indi- 
cate an  efficient  way  to  find  a "good"  integer  solution  to  the  singly  constrained  transportation 
problem. 

Our  solution  procedure  for  the  singly  constrained  transportation  problem  may  be  viewed 
as  a specialization  of  the  Double  Reverse  Method  of  Charnes  and  Cooper  (2).  Further,  our 
approach  uses  this  pedagogical  overview  to  exploit  the  subproblem  structure  of  the  singly  con- 
strained transporation  problem  by  viewing  the  problem  in  a graphical  context.  In  particular,  the 
basis  partitioning  simplex  algorithm  is  adapted  to  solve  this  class  of  problems  by  storing  the 
coefficient  matrix  and  the  basis  matrix  as  graphs  using  computer  list  structures. 

The  use  of  such  structures  reduces  both  the  amount  of  work  required  to  perform  the  basic 
simplex  steps  and  the  amount  of  computer  memory  required  to  store  essential  data.  The 
matrix  operations  of  finding  the  representation  of  an  entering  vector  and  determining  updated 
dual  variable  values  are  performed  by  tracing  paths  within  the  basis  graph.  Since  the  graphs 
contain  only  the  nonzero  entries  in  the  problem  (basis)  the  list  procedures  (1,7,9,12)  eliminate 
checking  or  performing  unnecessary  arithmetic  operations  on  zero  elements.  In  addition,  by 
exploiting  the  near-triangular  basis  properties  of  such  problems,  this  approach  allows  the  basis 
inverse  to  be  stored  implicitly  as  a graph.  This  graph  is  updated  during  basis  exchange  steps  by 
simply  changing  a few  pointers  in  the  list  structures.  This  entirely  eliminates  the  arithmetic 
operations  normally  used  to  update  the  basis  inverse  and  also  eliminates  round-off  error. 

The  advantages  of  this  approach  are  dramatically  illustrated  by  the  computational  results 
in  section  7.  These  results  indicate  that  this  approach  is  at  least  50  times  faster  than  the  state- 
of-the-art  linear  programing  techniques  contained  in  APEX-III. 


2.  TOPOLOGICAL  PROPERTIES  OF  THE  SINGLY  CONSTRAINED 
TRANSPORTATION  PROBLEM 

The  singly  constrained  transportation  problem  may  be  stated  as 
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t 

i 
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(1) 


minimize  ^ ^ c,i  + 05 

/-I  /-I 


subject  to  ^x,j  - a.,  , - 1.  2 m,  a,  > 0 

/-I 

m 

^ x,j  - bj,  j - \,  2,  , n,  b,  0, 

/-I 

Xy  > 0 for  all  / and  j. 


(2) 


1L  - S ~ k (where  some  /y  may  be  zero), 

/-I  /-i 


5^0, 


The  dual  of  this  problem  is 


as  we  assume  ^ a,  ^ £ (>/• 

/-I  /-I 


maximize  ^ a,/(,  + ^ b^Ki  + kb 

/-I  /-i 

(3)  subject  to  /i,  + Af,  + /y  6 < Cy,  / - 1.  2 m, 

8 > 0. 


/?/,  — unrestricted  for  all  / and  j. 


Graphically,  the  problem  may  be  portrayed  as  a bipartite  graph  whose  arcs  have  "flap” 
corresponding  to  the  coefficients  in  constraint  (2).  In  particular,  the  arc  incidence  conditions, 
as  defined  by  (1),  define  a transporation  network  and  the  extra  constraint  (2)  is  simply  viewed 
as  placing  Joint  restrictions  over  the  arcs.  Because  of  constraint  (2),  certain  columns  of  the 
coefficient  matrix  will  have  three  nonzero  entries;  the  arcs  corresponding  to  these  columns  are 
called  the  arcs  with  flags,  or  "flagged"  arcs. 

One  of  the  columns  of  the  coefficient  matrix  has  only  one  entry.  This  column 
corresponds  to  the  slack  variable  5 which  appears  in  (2).  The  slack  variable  is  another  special 
type  of  arc  which  may  be  viewed  as  being  attached  to  any  node.  For  convenience,  we  assume 
that  it  is  attached  to  node  1.  In  the  literature,  a slack  arc  has  been  referred  to  as  a slack  loop 
(see  Dantzig  [S]).  The  loop  structure  inherent  in  the  singly  constrained  transportation  problem 
will  play  an  important  role  in  the  subsequent  computational  development.  Figure  I illustrates 
the  complete  graph  structure  of  a 2 x 3 singly  constrained  tfansporation  problem.  The  number 
appearing  in  the  rectangle  on  each  arc  corresponds  to  the  cost  while  the  number  appearing  in 
the  semicircle  corresponds  to  the  flag  value  associated  with  the  arc. 

A basis  for  the  singly  constrained  transportation  problem  has  a structure  which  is  closely 
related  to  the  pure  transportation  problem.  It  is  generally  known  that  the  basis  of  a pure  tran- 
sportation problem  is  a spanning  tree  [S],  and  the  basis  has  rank  m -I-  />  - 1.  Since  the  extra 
constraint  (2)  involves  the  slack  variable  5,  the  singly  constrained  transportation  problem  will 
clearly  have  a basis  of  rank  m + n.  The  following  remark  relates  the  basis  structure  of  the  pure 
and  singly  constrained  problems;  it  also  leads  to  a complete  characterization  of  the  basis. 
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Figure  I.  illustration  of  graph  structure  of 
2x3  singly  constrained  problem. 


REMARK  1:  Consider  a basis  of  m + n columns  (arcs)  for  the  singly  constrained  tran- 
sporation  problem.  Delete  from  these  columns  (arcs)  the  component  corresponding  to  con- 
straint (2).  From  the  set  of  m + n reduced  columns  (arcs),  there  exists  m + n — I columns 
(arcs)  which  form  a basis  for  the  underlying  pure  transportation  problem. 

PROOF:  Let  B be  any  basis  for  the  singly  constrained  transportation  problem.  B is  an 
(/w  -F  n + 1)  X (m  -I-  n)  matrix.  Delete  the  row  of  S corresponding  to  the  components  associ- 
ated with  the  extra  constraint  (2);  the  rank  of  the  resulting  matrix  B'  is  at  least  m + n - \ 
since  the  row  rank  of  B is  m -f  n.  Furthermore,  since  the  rank  of  the  coefficient  matrix  of  the 
pure  transportation  problem  is  m + n — 1,  the  row  rank  of  B'  is  at  most  m + n — Thus  the 
rank  of  B'  is  m + n - Hence  there  exists  m + n - \ independent  arcs  among  the  m -F  n 
arcs  in  B'.  By  definition  these  m + n — I arcs  form  a spanning  tree  for  the  pure  transportation 
problem. 

It  is  a fundamental  property  of  spanning  trees  that  a spanning  tree  plus  one  additional  arc 
incident  on  the  tree  nodes  contains  precisely  one  loop  [14,18].  The  basis  for  the  singly  con- 
strained transporation  problem  can  assume  one  of  two  possible  loop  structures.  In  one  case  the 
basis  contains  m + n arcs  which  are  indigenous  to  the  underlying  pure  transporation  problem. 
In  this  case  the  basis  can  be  viewed  as  a tree  with  an  extra  arc.  In  the  other  case,  the  basis  con- 
tains the  slack  arc  and  m -F  n — 1 arcs  which  are  indigenous  to  the  pure  transportation  prob- 
lem. In  this  case  the  basis  can  be  viewed  as  a slack  loop  with  an  attached  tree. 

In  the  next  sections  an  efficient  primal  simplex  algorithm  is  developed  for  the  singly  con- 
strained transportation  problem.  The  efficiency  of  this  algorithm  is  derived  from  exploiting  the 
near-triangular  structure  of  the  basis  in  a graphical  context.  Specifically,  we  view  the  basis  as 
being  stored  as  a rooted  tree  plus  an  extra  arc  using  the  efficient  list  structures  and  labeling  pro- 
cedures [1,7,9,12]  for  representing  a spanning  tree  as  a rooted  tree.  These  list  structures  and 
labeling  techniques  provide  the  means  for  performing  the  two  types  of  tree  traversal  needed  to 
execute  the  basic  simplex  steps. 


Specifically,  these  lists  allow  one  to  find  efficiently  the  unique  path  in  a spanning  tree  join- 
ing one  node  to  another  node.  This  feature  allows  one  to  find  easily  the  loop  (called  the  basis 
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loop)  created  in  the  spanning  tree  by  the  extra  basis  arc.  Similarly,  the  loop  created  in  the 
spanning  tree  by  adding  an  entering  nonbasic  arc  is  called  a nonbasis  loop.  In  section  4,  we 
show  that  the  representation  of  any  nonbasic  arc  in  terms  of  the  basis  can  be  easily  determined 
by  reference  to  the  basis  loop  arcs  and  nonbasis  loop.  Additionally,  these  lists  can  be  used  to 
locate  all  nodes  situated  below  a given  node  in  the  rooted  tree.  (Note  that  this  discussion 
assumes  that  the  root  is  the  highest  node  in  the  tree.)  This  feature  is  used  to  update  the  dual 
variable  values  quickly  and  efficiently. 

Dual  variable  values  are  updated  in  some  cases  by  locating  all  nodes  situated  "below"  a 
given  node  of  the  arc  leaving  the  basis.  In  other  cases,  when  all  the  dual  variables  values 
change,  this  may  be  efficiently  done  by  starting  at  the  root  of  the  tree  and  visiting  each  node  of 
the  tree.  The  efficiency  of  this  approach  derives  from  the  facts  that  only  nonzero  coefficients 
are  stored,  that  operations  involving  a basis  matrix  inverse  may  be  performed  using  the  basis 
graph,  and  that  basis  exchange  steps  involve  only  the  updating  of  list  structure  pointers. 


3.  EFFICIENT  IMPLEMENTATION  OF  THE  PRICING-OUT  PROCEDURE 

In  this  section  computationally  efficient  procedures  for  determining  the  dual  evaluator 
values,  and  thus  the  updated  costs,  are  presented.  Pricing  out  the  basis  (that  is,  determining 
the  dual  evaluator  values)  is  equivalent  to  finding  the  simultaneous  solution  of  the  equality 
form  of  the  dual  constraints  in  (3)  associated  with  the  primal  variables  in  the  basis.  The  result- 
ing set  of  dual  equations  forms  an  (m  + n)  x (m  + n + ])  system  of  equations.  Our  pro- 
cedure for  determining  the  dual  evaluators  greatly  simplifies  the  solution  of  the  (m  + n)  x 
(w  -E  n -I-  1)  system  of  dual  equations.  The  procedure  is  a two-step  approach  in  which  the 
value  of  the  dual  evaluator,  8,  is  initially  obtained  and  the  values  of  the  remaining  dual  evalua- 
tors are  subsequently  determined. 

In  the  previous  section,  we  noted  that  there  are  two  types  of  basis  graph  structures.  In 
one  case  the  graph  contains  a loop  consisting  of  a single  slack  arc;  in  the  other  case  the  loop 
consists  of  a sequence  of  an  even  number  of  arcs. 

For  a basis  graph  which  includes  a slack  (artificial)  arc,  the  Complementary  Slackness 
Theorem  implies  that  the  dual  evaluator,  8,  associated  with  the  extra  constraint,  can  be 
immediately  priced  to  zero  (M).  Since  the  remaining  basic  arcs  form  a spanning  tree,  the 
remaining  dual  evaluators  may  be  efficiently  determined  if  the  basis  has  been  stored  as  a rooted 
tree  using  the  labeling  in  [1,7,9,12].  Thus,  one  can  start  at  the  root  of  the  tree,  setting  its  dual 
variable  to  zero,  and  sequentially  visit  ail  nodes  in  the  tree  using  the  dual  equations  to  calculate 
their  values. 

On  the  other  hand,  a graph  which  contains  a "nonslack"  basis  loop  requires  a more  ela- 
borate treatment.  (For  editorial  convenience  we  shall  denote  such  a loop  by 
1(1, 1),  (2, 1),  (2,2),  (3,2),  . . .,  (.p.p),  (l,p)].  Figure  2 illustrates  this  loop  with  associated  costs 
c,i,  flag  values  /„,  and  dual  evaluators  8,  R„  and  Kj  for  each  appropriate  node.)  This  case  may 
be  handled  by  first  determining  the  value  of  the  dual  evaluator  8 and  then  pricing  out  the 
rooted  tree  in  a manner  analogous  to  the  slack-loop  case  to  find  the  /?,  and  A,  dual  evaluators. 

Thus,  maintaining  a partition  of  the  basis  as  a rooted  spanning  tree  and  one  other  arc 
enables  us  to  determine  immediately  the  remaining  dual  evaluators  once  the  value  of  8 has 
been  determined.  Further,  we  show  that  the  dual  evaluator  8 can  be  easily  determined  by  con- 
sidering only  the  dual  equations  associated  with  basic  variables  in  the  basis  loop. 


i 


Figure  2.  a nonslack  loop  with  associated  dual  evaluators. 


Since  the  dual  constraints  associated  with  the  basic  arcs  are  binding  at  optimality,  the  dual 
evaluators  /?,,  Kj  and  S associated  with  the  loop  in  Figure  2 may  be  found  by  solving  the  sys- 
tem of  simultaneous  equations; 

(4)  /?|-F/k,  = 

/k  I + + /21  8 “ C21 

^2+^2  + /22  * “ ^'22 

^2+^3  + /32  S ” <^32 


SINGLY  (ONSTKAINII)  IK  ANSIA  IK  I A I H »N  MKi 


PROOF:  The  entire  system  of  dual  eguations  associated  anih  ilic  ba*  < is~  . nl 

freedom.  Thus  it  will  be  assumed,  with  loss  of  generality  that  ft  • 0 lisa  gonii  Misulves 
solving  the  first  equation  in  (4)  for  A | in  terms  of  A and  then  performing  the  soissiiiu 

tion  into  the  next  equation  in  (4),  solving  repeatedly  lor  each  successive  vaiiaMc  in  terms  at 
the  preceding,  and  hence  in  terms  of  8. 

Solving  the  first  equation  in  (4)  for  A | in  terms  of  8,  we  obtain 

<6)  A-  r A 

- t ii  - /ii 

Substituting  (6)  into  the  second  equation  in  (4),  we  have 

/?2-(cj|-/j,  8)-(C||-/,,8). 

Using  (7)  in  the  third  equation  yields 

(8)  ^2  ” (<^22  “ /22  8)  + ~ /ii  ~ (f2i  ~ /21 

which  secures 

A 3 — (C32  — /32  8)  + {C21  — /21  8) 

“ (^22  ~ /22  8)  “ (C||  ~ /||  8). 

Thus,  repeated  performance  of  these  operations  yields, 

(10)  = f (c„.,  - S)  -f  (c„  - f„  6), 


/-2 


/-2 


(11) 


t 8)  -t  (r„_,  - f„. 


8). 


/-2 


Focusing  attention  on  the  last  equation  of  (4)  yields 


f\r  8 - t I,  - t fn- 


(12) 


8 + i + C| 
(-2 

P 


p-t 


^ .4,  - ^ .4,-1  - .4|;,  8 = ^ c„  - ^ c„.|  - 
'-I  ,-2  ,-i  ,-2 


Assuming  that  ^/„  - ^ .4,.|  - /i^  is  not  equal  to  zero,  expression  (12)  yields  the  desired 
,-i  ,-2 

result.  By  focusing  attention  on  the  loop  portion  of  the  basis,  we  may  show  that  this  assump- 
tion is  valid.  In  particular,  we  form  an  "alternating"  linear  combination  of  the  columns  associ- 
ated with  the  loops  where,  in  traversing  the  loop  clockwise,  the  loop  arcs  traversed  in  the  for- 
ward direction  are  multiplied  by  -I- 1 and  those  traversed  in  the  backward  direction  are  multi- 
plied by  -I.  The  resulting  linear  combination  is 


(13) 


+ IF,,  - IF21  + l/’22-  1^32+  - P 


■ r 


^/„  “ ~ f\i 


,-i  ,-2 
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where  P,,  denotes  that  column  of  the  coefficient  matrix  associa'ed  with  variable  x,,.  Thus,  if 
^ — J\p  is  zero,  then  the  loop  arcs  would  be  linearly  dependent,  a contradiction, 

i-l  1-2 

thus  completing  the  proof. 

The  graphical  technique  proposed  by  the  theorem  for  calculating  the  value  of  the  dual 
variable  8 may  be  described  in  operational  terms  as  follows.  Keep  the  partitioned  basis  stored 
as  a spanning  tree  and  an  extra  arc.  Starting  at  origin  loop  node  1 , set  dual  variable  /? , = 0. 
Next,  traverse  th;  basis  loop  in  a clockwise  direction.  While  traversing  the  basis  loop  keep  a 
sum  and  a sum  f"*  of  the  cost  coefficients  and  flag  values,  respectively,  of  the  arcs  traversed 
in  a forward  direction.  Likewise,  form  a sum  C*  and  a sum  F~  of  the  cost  coefficients  and  flag 
values,  respectively,  of  the  arcs  traversed  in  a backward  direction.  Then 
(C^  - C~)  / - F ) is  the  desired  value  of  8. 

From  a count  of  operations  it  is  not  difficult  to  see  that  the  foregoing  technique  consti- 
tutes the  most  efficient  means  for  computing  8 via  expression  (5).  After  calculating  the  value 
8,  start  at  the  root  of  the  tree,  setting  its  dual  evaluator  to  zero.  Whereupon,  the  other  dual 
evaluators  are  immediately  determined  in  a cascading  fashion  if  we  move  down  the  tree  and 
identify  the  value  for  each  /?,  or  K,  from  its  predecessor  by  using  the  equation 
R,  + K,  + (>/,!  = c,i.  Highly  efficient  labeling  procedures  for  traversing  the  tree  to  initialize  and 
update  these  dual  evaluators  are  described  in  [1,7,9,12]. 


4.  CHANGE  OF  BASIS  AND  UPDATING  PROCEDURES 

In  this  section  we  present  computationally  efficient  procedures  for  implementing  the 
change  of  basis  proceouie  and  for  updating  the  dual  evaluators  as  well  as  the  basic  flows.  The 
discussion  provides  a connection  between  the  calculation  of  8 in  the  preceeding  section  and  cal- 
culations involved  in  the  change  of  basis,  enabling  the  latter  to  be  carried  out  with  marginal 
additional  effort. 

In  order  to  execute  a change  of  basis  it  is  necessary  to  find  the  representation  of  a non- 
basic  arc  {r,  s)  in  terms  of  the  basic  arcs.  Because  of  the  structure  and  previous  calculations, 
the  determination  of  the  representation  is  relatively  simple.  We  have  three  cases: 

CASE  1:  Suppose  the  arc  (r,  s)  is  the  slack  arc.  (Let  us  denote  the  column  of  the 
coefficient  matrix  associated  with  arc  (r,  s)  as  P„  and  let  P„  denote  that  portion  of  P„  associ- 
ated with  the  underlying  pure  transportation  problem;  i.e.,  is  an  m -(-  n -t-  1 component  vec- 
tor (since  it  contains  a flag  component)  whereas  P„  has  only  m + n components.)  Prior  to  the 
entrance  of  the  slack  arc  in  the  basis,  the  basis  loop  must  consist  of  a "nonslack"  loop.  Then 
from  (13)  we  can  find  the  representation  of  the  slack  arc  by  solving  for  0 such  that 


0 


0 

F^  - F- 


(14)  0 


— —e 


m + f 


(where  e'"*^"*^'is  the  m+n  + l column  of  an  identity  matrix). 
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Clearly, 

- I / — F—)  (Note  that  we  are  assured  0 does  not  equal  zero.) 

Thus  the  slack  arc  is  represented  solely  in  terms  of  the  basis  loop  arcs  and  this  representation  is 
given  by 

(16)  + f^ii  - 0^21  + - ...-eP,^  = 

Note  that  no  additional  calculations  are  required,  since  - F'  was  determined  in  the  pricing- 
out  procedure  of  the  preceding  section.  Further  note  that  the  coefficients  in  the  representation 
are  equal  in  magnitude.  Thus  to  determine  the  column  vector  (arc)  to  leave  the  basis  only 
requires  checking  the  sign  of  0 and  then  finding  the  minimum  flow  value  associated  with  the 
loop  arcs  traversed  in  a forward  (or  backward)  direction  if  0 is  positive  (or  negative).  Thus  no 
ratios  have  to  be  computed  to  find  the  vector  to  leave  the  basis.  Also,  the  flow  values  associ- 
ated with  the  basis  can  be  immediately  updated  with  a minimum  of  computational  effort  by 
simply  adding  or  subtracting  the  current  flow  value  of  the  arc  leaving  the  basis  from  the 
appropriate  basis  loop  arcs  and  setting  the  flow  on  the  entering  slack  arc  to  this  value  divided  by 
0-  All  other  basic  flows  remain  unchanged. 


Since  the  entrance  of  a slack  arc  in  the  basis  causes  both  the  value  of  8 and  basis  loop 
structure  to  change,  the  dual  evaluators  associated  with  the  new  basis  cannot  be  conveniently 
updated,  thus,  the  dual  evaluators  would  have  to  be  calculated  as  in  section  2. 


CASE  2:  Now,  suppose  that  the  incoming  arc  (r,s)  is  not  a slack  arc,  but  that  the  slack 
arc  is  contained  in  the  basis.  By  Remark  1,  the  current  basis  may  be  partitioned  into  a spanning 
tree  and  the  slack  arc.  Focusing  only  on  the  spanning  tree  portion  of  the  basis,  we  can  deter- 
mine the  unique  undirected  path  which  leads  from  origin  node  r to  destination  node  s Such  a 
path  may  be  denoted  ((zj,),  (/„y,),  (/„  yj),  ...  (/^,.,)).  (Note  this  path  will  contain  an  odd 
number  of  arcs.)  We  make  the  following  observation: 


(17) 


' P -IF  -I-  1 F - 

'•'1  ‘'mm  ^ "m'i 


+ 1 F = 


- -^MM  + 


Le.,  the  linear  combination  formed  by  multiplying  arcs  in  the  path  which  are  traversed  in  the 
forward  direction  by  -t-1  and  arcs  in  the  path  which  are  traversed  in  the  backward  direction  by 
-1  yields  a vector  which  equals  F „ is  all  components,  except  possibly  the  w -E  « -)-  1st  Thus 
we  use  the  slack  basis  loop  arc  and  the  results  of  (17)  to  form  the  following  equation: 


F/i  /,,,,  + - . . . + / 


0 

(f„| 

+ 0 

- / 

0 

-1 

\Jrs\ 

(18) 

Solving  yields 

(1^1  0’“  — f-¥f—  -t-/' 

Thus,  with  the  value  of  0 as  computed  in  (19),  we  have  representation  of  arc  (z,  j)  as 

+ l^n,-  IF,,„+  lA„,+  ...+  + - F„ 

where  Fy  is  the  column  vector  corresponding  to  the  slack  variable  S. 
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The  determination  of  the  column  vector  (arc)  to  leave  the  basis  is  again  amenable  to 
computational  simplification;  we  have  two  cases.  If  9 in  (20)  is  less  than  or  equal  to  zero,  the 
computation  is  particularly  simple.  The  determination  of  the  column  vector  (arc)  to  leave  the 
basis  only  requires  finding  the  minimum  flow  value  associated  with  the  unique  path  arcs 
I which  are  traversed  in  a forward  direction.  The  arc  associated  with  the  minimum  flow  value 

> is  the  arc  to  leave  the  basis.  The  basic  flows  may  be  updated  by  assigning  a flow  of  Kp  to  the 

incoming  arc,  adding  Yp  to  the  flow  on  the  "backward  arcs,"  subtracting  Yp  from  the  flow  on  the 
t "forward  arcs"  in  the  unique  path,  and  adding  —9  Yp  to  the  flow  on  the  slack  arc. 


If  9 in  (20)  is  greater  than  zero,  we  need  to  compute  two  values.  Let  Yp  be  calculated  as 
above,  and  let  Y^  equal  the  flow  on  the  slack  arc  divided  by  9.  Then  the  arc  to  leave  the  basis 
will  be  the  arc  whose  flow  value  is  equal  to  the  minimum  of  Yp  and  If  the  arc  to  leave  the 
basis  is  in  the  unique  path,  then  the  flows  may  be  updated  as  above  (i.e.,  as  in  the  previous  , 

case  when  9^0).  If  the  outgoing  arc  is  the  slack  arc,  then  we  add  and  subtract  Ty  from  the 
appropriate  arcs  in  the  unique  path.  All  other  basic  flows  remain  unchanged. 

In  general,  when  the  arc  to  leave  the  basis  is  not  contained  in  the  basis  loop,  then  it  is 
computationally  advantageous  to  update  the  dual  evaluators  rathei  than  to  calculate  them 
directly  as  in  section  2.  Thus  when  the  incoming  arc  does  not  replace  the  slack  arc,  we  will 
update  the  dual  evaluators.  In  this  case  the  origin  node  k and  the  destination  node  t of  the  out- 
going arc  (k,  t)  will  be  contained  in  the  spanning-tree  portion  of  the  basis.  Deleting  the  arc 
(k,  i)  from  the  basis  splits  the  spanning  tree  into  two  disjoint  subtrees,  say  T,  and  Tj.  One  of 
these,  say  T),  is  attached  to  the  loop  and  is  disjoint  from  the  remainder  of  the  basis.  All 
dual  evaluators  associated  with  nodes  not  in  (including  8)  will  remain  unchanged.  To 
update  the  dual  evaluators  associated  with  nodes  in  Tj,  we  compute 

(21)  c - /?,  + A,  -t-  8/„  - c„.  I 

which  is  the  current  updated  cost  associated  with  the  incoming  arc  (r.s).  The  dual  evaluators  in  | 

Ti  may  be  updated  in  the  following  manner.  If  r€  Tj,  subtract  cfrom  R,  anu  all  other  such  i 

that  kiTi-  Further,  add  c to  all  dual  evaluators  A,  such  that  If  sETj,  subtract  cfrom  > 

A,  and  all  other  A/  such  that  T^.  Further,  add  c to  all  dual  evaluators  R,  such  that  /€  Ti-  A ■ 

proof  of  the  validity  of  the  preceding  updating  procedure  follows  straightforwardly  from  the 

facts  established  in  the  papers  [1,7,9,12).  ( 


CASE  3;  We  finally  consider  the  case  in  which  the  arc  (r.s)  tc  enter  the  basis  is  not  a 
slack  arc  and  a slack  arc  is  not  contained  in  the  current  basis.  Again  we  partition  the  basis  into 
a spanning  tree  plus  one  additional  arc.  Using  the  results  of  (13)  and  (17)  we  obtain 


(22) 


Solving  yields 
(23) 

Thus, 


- F F*  - F- 
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and  the  representation"  of  is  given  in  (24).  Again  little  additional  computation  is  required, 
since  F*  - f"  has  previously  been  determined,  and  computationally  efficient  methods  of  deter- 
mining the  unique  path  are  available  [1,7,9,12]. 

In  determining  the  column  vector  (arc)  to  leave  the  basis  and  subsequent  flow  changes, 
we  first  note  that  some  of  the  column  vectors  (arcs)  in  (24)  may  appear  twice.  That  is,  some 
of  the  column  vectors  (arcs)  in  the  unique  path  may  also  be  contained  in  the  basis  loop.  When 
the  column  vectors  (arcs)  in  (24)  appear  only  once,  certain  computational  simplifications  arise. 
In  this  case,  we  have  two  subcases.  First,  if  9 - 0 we  have  basically  the  same  situation  as  in 
case  2 when  0^0.  Thus  the  change  of  basis  procedure  would  be  performed  in  a manner 
analogous  to  case  2.  Also,  calculate  K;^,  the  minimum  flow  associated  with  a "forward  loop 
arc,"  and  Ky',  the  minimum  flow  associated  with  a "backward  loop  arc."  If  > 0 (0  < 0), 
compare  with  K,'*’  / 0 (—Yf!  0).  If  is  the  smaller  of  the  two  values,  then  the  vector 
(arc)  to  leave  the  basis  is  in  the  unique  path.  The  flows  may  be  updated  by  assigning  a flow  of 
f'p  to  the  incoming  arc  (r,  s),  adding  or  subtracting  from  the  appropriate  unique  path  arcs, 
and  adding  or  subtracting  0 from  the  appropriate  loop  arcs.  (Notice  that  in  this  case  Y,^  and 
Yi~  may  be  easily  updated.)  If  is  the  larger  of  the  two  values,  then  the  vector  (arc)  to  leave 
the  basis  is  in  the  loop.  When  0 > 0 (0  < 0)  the  incoming  arc  (r,  s)  is  assigned  a flow  of 
Yi^  I 0 {-Yr  I 0),  and  Y*  i-Y,')  is  added  to  the  appropriate  loop  arcs.  Similarly, 
Yi^  / 0 (-  Yf  / 0)  is  added  to  the  appropriate  arcs  in  the  unique  path.  All  other  basic  flows 
remain  unchanged. 

When  the  vectors  (arcs)  in  the  unique  path  are  contained  in  the  loop  portion  of  the  basis, 
no  simplifications  arise.  (Note  that  in  this  case  0 ^ 0.)  In  this  case  the  "usual"  linear  program- 
ming pivoting  process  must  be  used  to  determine  the  vector  (arc)  to  leave  the  basis  and  the 
updated  basic  flows. 

Again,  we  may  consider  updating  the  dual  evaluators.  If  the  vector  (arc)  to  leave  the 
basis  is  in  the  loop,  then  updating  is  not  computationally  advantageous  and  the  direct  method 
of  calculating  the  dual  evaluators  should  be  applied.  However,  if  the  vector  (arc)  leaving  the 
basis  is  not  in  the  loop  portion  of  the  basis  we  may  update  the  dual  evaluators  in  a manner 
completely  analogous  to  the  updating  in  case  2. 

The  previous  procedures  for  determining  representations  and  updating  basic  flows  and 
dual  evaluators  have  been  designed  to  minimize  computational  effort.  However,  another 
important  advantage  of  these  procedures  involves  the  accumulated  round-off  error.  The 
pricing-out  procedure  minimizes  round-off  error  in  that  the  calculation  of  6 contains  all  of  the 
round-off  error;  all  other  dual  evaluators  (whether  calculated  directly  or  by  updating)  are 
derived  from  original  parameters  by  addition  and  subtraction.  Likewise,  the  calculation  of  the 
flow  values  minimizes  round-off  error.  However,  in  successive  pivoting  some  error  may  accu- 
mulate, and  it  may  be  desirable  to  generate  an  exact  representation  of  the  original  stipulating 
vector. 


5.  OBT  MNING  A BASIC  PRIMAL  FEASIBLE  SOLUTION 

We  now  consider  the  problem  of  obtaining  a basic  "feasible"  start  for  the  singly  con- 
strained transportation  problem.  The  problem  is  easily  handled  by  extending  the  primal  start 
algorithms  which  exist  for  the  pure  transportation  problem.  A variety  of  start  algorithms  exists 
for  obtaining  a basic  feasible  start  for  the  pure  transportation  problem.  Primal  start  methods 
include  the  Northwest  Corner  Rule  [5],  Vogel  Approximation  Method  (VAM)  [17],  Row 


I 
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Minimum  Rule  (61,  Row  Column  Minimum  Rule  (5J,  Modified  Row  Minimum  Rule  |8J, 
^Matrix  Minimum  Rule  (13),  and  Matrix  Minimum  Rule  by  Ranking  Method  116).  Another 
way  of  obtaining  a good  basic  start  is  to  use  an  optimal  solution  to  the  underlying  pure  tran- 
sportation problem. 

REMARK  2:  A basic  "feasible"  solution  for  the  singly  constrained  transportation  problem 
may  be  obtained  by  applying  one  of  the  above  starting  methods  to  the  underlying  pure  tran- 
sportation problem  and  then  adding  an  appropriate  slack  or  artificial  (as  determined  by  (2))  to 
the  pure  transportation  starting  basis. 

PROOF:  Since  the  pure  transportation  start  yields  a feasible  spanning  tree  for  constraints 
(1)  of  the  singly  constrained  problem,  we  need  only  augment  the  pure  starting  basis  by  an  extra 
variable  (slack  or  artificial)  whose  value  will  satisfy  (2).  Clearly  the  augmented  vectors  are 
linearly  independent  and  satisfy  (1)  and  (2).  Thus  a basic  solution  is  easily  determined  which 
satisfies  primal  feasibility,  possibly  in  an  artificial  sense. 


6.  OBTAINING  A FEASIBLE  INTEGER  SOLUTION 

An  optimum  solution  to  a singly  constrained  transportation  problem  (as  with  most  linear 
programming  problems)  may  be  noninteger.  In  this  section  we  present  a simple  means  of 
obtaining  a "good"  feasible  integer  solution  to  the  singly  constrained  transportation  problem. 
(Throughout  the  section  we  assume  that  the  transporation  parameters  are  integer,  that  is,  we 
assume  that  the  cost  coefficients  and  supply  and  demand  amounts  are  integer.)  Such  an  integer 
solution  may  serve  as  an  end  in  itself  or  provide  an  upper  bound  on  the  optimum  integer  solu- 
tion. If  an  optimum  integer  solution  is  desired,  then  a relatively  tight  upper  bound  should  pro- 
vide a useful  tool  for  implementing  a branch-and-bound  solution  procedure.  Furthermore,  the 
feasible  integer  solution  may  be  obtained  with  little  additional  computation  beyond  the  nonin- 
tegral optimum. 

REMARK  3:  A feasible  integer  solution  to  the  singly  constrained  transportation  problem 
may  be  obtained  by  pivoting  the  slack  variable  (in  constraint  (2))  into  the  optimal  basis. 

PROOF.  Of  course,  if  the  slack  variable  is  in  the  optimal  basis,  then  the  optimal  solution 
is  integer,  and  no  additional  work  is  necessary.  Thus,  it  is  only  necessary  to  show  that  it  is 
always  possible  to  pivot  the  slack  variable  into  the  optimal  basis  and  maintain  primal  feasibility. 
To  do  this,  it  is  necessary  to  show  that  the  representation  of  the  slack  variable  contains  at  least 
one  positive  component.  From  expression  (17)  we  see  that  half  of  the  coefficients  in  its 
representation  have  a value  of  fl  and  the  other  half  a value  of  -ff.  Furthermore,  expression 
(16)  guarantees  that  0 does  not  equal  zero.  Hence  it  will  always  be  possible  to  pivot  the  slack 
variable  into  the  basis  and  maintain  primal  feasibility. 


7.  COMPUTATIONAL  RESULTS 

A computer  code  was  developed  for  solving  singly  constrained  transportation  problems  in 
order  to  confirm  the  relative  efficiency  of  the  preceding  algorithms.  The  code  (TRANSC)  uses 
the  threaded  index  method  [12)  for  storing  the  basis  in  graph  form  and  also  uses  the  pro- 
cedures presented  earlier.  The  code  is  written  entirely  in  standard  FORTRAN  IV  and  was  ini- 
I tially  debugged  and  tested  using  the  RUN  compiler  on  a CDC  6600  computer.  Under  these 

conditions  it  occupies  a total  of  6(A/  + iV)  + 3/1  + 6600  words  of  central  memory,  where  M 
and  A/ denote  the  number  of  origins  and  destinations,  respectively,  and  A the  number  of  arcs. 
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Following  the  development  of  TRANSC  a computational  study  was  conducted  to  deter- 
mine good  start  and  pivot  rules  to  use  in  conjunction  with  the  preceding  algorithm  and  to 
evaluate  the  adequacy  of  the  integer  solution  provided  by  pivoting  the  slack  variable  into  the 
basis.  To  conduct  the  testing,  35  feasible  problems  were  randomly  generated.  The  test  results 
indicated  that  the  best  start  rule  was  to  "introduce"  the  extra  constraint  to  the  optimal  basis  for  ! 

the  underlying  transportation  problem  and  to  use  a multipricing  (candidate  list)  pivot  criterion  i 

of  size  8-16.  A complete  description  of  a T-R  candidate  list  follows:  I 

Examine  sequentially  each  of  the  origins  for  the  underlying  transportation 
problem.  For  each  origin,  select  the  nonbasic  arc  that  violates  dual  feasibility 
by  the  largest  amount.  Add  this  arc  (if  one  exists)  to  the  list  and  proceed  to 

the  next  origin.  This  is  done  until  the  list  contains  R entries  or  until  all  of  ; 

the  origins  have  been  explored.  From  the  list  the  "best"  arc  is  selected  to 

enter  the  basis.  The  list  is  accessed  after  each  pivot  until  it  is  void  of  eligible 

edges  or  until  it  has  been  accessed  T times.  At  such  time,  the  list  is  refilled 

by  again  examining  each  origin,  starting  at  the  last  examined  origin.  When 

the  last  origin  is  encountered,  the  dual  constraint  of  the  slack  variable  5 is  I 

checked  before  proceeding  to  the  first  origin.  If  a complete  pass  through  the  i 

origins  is  made  without  filling  the  list,  the  size  of  R is  reset  to  the  number  ' 

found  and  7"  is  reduced  to  1/2/?.  , 

This  type  of  procedure  has  been  found  to  be  exceptionally  efficient  for  pure  and  generalized  t 

network  codes.  For  this  reason,  the  effect  of  the  initial  values  of  T and  R for  the  singly  con-  } 

strained  code  were  tested. 

In  general,  our  testing  indicates  that  a singly  constrained  transportation  problem  requires  I 

approximately  three  times  as  much  time  to  solve  as  the  underlying  transportation  problem.  The 
number  of  nonzero  coefficients  in  the  extra  contraint  affects  the  number  of  times  the  dual  vari-  i 

able  8 changes.  Consequently,  solution  time  tends  to  increase  as  the  number  of  nonzero 

coefficients  increases.  Further,  the  integer  solution  obtained  by  pivoting  the  slack  into  the  ' 

optimal  continuous  basis  never  increased  the  objective  function  value  by  more  than  5%  and 
often  by  only  1%.  Thus,  a good  integer  solution  was  always  obtained.  j 

The  next  phase  of  testing  was  to  compare  the  best  version  of  singly  constrained  code 
TRANSC  with  a widely  used  standard  linear  programming  package,  APEX  III.  APEX  III  is  dis- 
tributed by  Control  Data  Corporation,  and  the  tests  were  conducted  on  a CDC  CYBER-74  com- 
puter. In  this  case,  the  three  problems  indicated  in  Table  1 were  used  as  test  data.  These  prob-  > 

lem  range  from  a 50  x 50  constructed  transportation  problem  to  a 500  x 500  node  constrained  , i" 

transportation  problem.  The  basis  of  comparison,  however,  is  not  CPU  seconds,  but  rather  a f 

CDC  accounting  measure  called  an  SBU.  The  total  SBU  count  for  a job  is  an  accumulation  of  f 

CPU  seconds  used,  I/O  functions  performed,  and  central-memory  requirements.  This  measure  i 

seemed  appropriate,  especially  since  it  allowed  the  comparison  of  actual  dollar  amounts.  CDC  I 

charges  customers  a minimum  of  SO.  18  per  SBU  used. 

The  results  of  the  comparison  of  TRANSC  and  APEX  III  are  contained  in  Table  2.  In 
this  case,  TRANSC  was  from  45.3  times  more  efficient  (on  a small-sized  problem)  to  67.96 
times  more  efficient  (on  a large  scale  problem).  j 
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Table  1.  Problem  Specifications 


1 

Problem 

Origins 

Destinations 

Arcs 

Cost 

Range 

Total 

Supply 

Min 

Max 

I 

50 

50 

500 

H 

100 

100000 

2 

200 

700 

5000 

1 

100 

100000 

3 

500 

500 

5000 

II 

100 

100000 

Extra  Constraint 


Problem  Nonzero  pefficient  Range  Rjght-Hand  Side 


Coefficients 

Min 

Max 

Value 

1 

3000 

-1.5 

1.5 

50000 

2 

4000 

-2.0 

2.0 

20000 

3 

6000 

-I.O 

1.0 

10000 

Table  2.  TRANSC  vs  APEX-III 


1 

Problem 

j TRANSC 

APEX-III 

SBU’s' 

Cost^ 

SBU’s' 

Cost^ 

1 

n 

0.72 



0.13 

32.77 

2 

13.83 

2.49 

925.18 

3 

17.26 

3.10 

1170.42 

System  Billing  Unit 
Computed  at  $0.18  per  SBU 
Terminated  after  10,000  iterations 
Objective  function  value  ~ 2.S  billion 
Optimal  value  - 6 million 
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[2] 

(31 

[41 

(51 

(61 

(71 

(81 

(9 

no: 

[111 

(12! 

[131 

(141 

(151 

(161 

(171 

[18] 

(191 

[201 

(211 
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ABSTRACT 


A generalized-indices  Iransporlation  problem  is  formulaled  and  an  algo- 
rithm is  presented  for  its  solution.  The  algorithm  is  an  extension  of  the  modi- 
method.  A theorem  on  the  number  of  independent  variables  in  the 
generalized-indices  transportation  problem  is  proved.  An  example  problem  is 
solved  for  the  four-indices  transportation  problem.  A computer  program  has 
been  written  to  solve  any  four-indices  problem. 


INTRODUCTION 


The  traditional  transportation  problem  is  one  of  minimizing  the  distribution  costs  between 
a set  of  origins  and  a set  of  destinations.  This  involves  two  indices  on  the  variables  and  the 
costs.  Schell  [10]  was  the  first  to  introduce  the  concept  of  a multidimensional  transportation 
problem.  In  his  example  there  are  various  types  of  products  to  be  distributed  between  a set  of 
origins  and  a set  of  destinations.  Haley  (5]  presented  a solution  procedure  for  the  three-indices 
problem.  In  [6)  he  showed  various  applications  of  the  three-indices  problem.  The  third  index 
can  represent  different  modes  of  transport  between  a set  of  origins  and  destinations,  or  it  may 
represent  a set  of  intermediate  warehouses. 


Theorems  for  the  existence  of  a solution  to  the  three-indices  problem  have  been 
presented  by  Haley  (6,  7,  8),  by  Moravek  and  Vlach  (91,  and  by  Smith  (12).  Corban  [2]  has 
also  solved  the  multi-index  transportation  problem. 


An  example  of  the  four-indices  problem  would  be  the  minimization  of  total  transportation 
costs  between  m factories,  n warehouses,  p wholesale  outlets,  and  q retail  stores.  A typical 
application  of  this  problem  would  be  in  the  oil  industry,  where  oil  has  to  be  transported  from 
refineries  to  intermediate  storage  areas,  to  district  headquarters,  and  finally  to  individual  service 
stations. 


A GENERALIZED-INDICES  TRANSPORTATION  PROBLEM 
Formulation  of  Problem 


We  may  write  this  generalized-indices  transportation  problem  as 


(1) 


f— m k^p  l^q 

Minimize  X XL  "L  / c./* ...  / 

/-I  /-I  *-i  /-I 


J' 
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/-I 

1 - X,,k...h 

V7,  k.  .. 

. . 1. 

i“n 

/-I 

' “ X,,k...h 

V/,  k.  .. 

. . /, 

1 ~ X,i,...i. 

V/,  j.  ... 

. /, 

V/,  ./.  k,  ... 


where  a comma  (,)  in  the  subscript  of  x indicates  a summation  over  that  particular  subscript,  V 
is  an  abbreviation  for  "over  all  combinations  of".  Where 


(3) 

f-n 

Z Xjk...l 

f-l 

. y k . 

..  1. 

1 

L x.ik... 

k^\ 

/-w 

1 - L x,i....i 

f-l 

. y j .. 

. 1. 

1 

L x.ik  .i  - L V 7*  ■ • . 

/-I  /-I 

Constraints  (3)  represent  a set  of  constraints  that  relate  x,*  , with  x,*  /,  x„  , , , and  x„*  . 

Similar  constraints  exist  for  x,  * /,  x,^  / , . . . , and  x„n  with  only  downward  combinations  con- 
sidered so  as  to  eliminate  duplicate  constraints,  and 


(4) 

Solution  Procedure 


>0  , V(,  J,  k.  ...  I. 


The  solution  procedure  is  an  extension  of  the  modi-method  (13,  51.  The  northwest- 
corner  method  is  used  to  get  an  initial  solution. 

Obtaining  an  Initial  Basic  Feasible  Solution 


It  is  shown  in  the  appendix  that  there  are  mnp  ...  q - (w-1)  (n-l)  (p-l)  ...  (^-1) 
independent  variables  in  the  generalized  indices  problem.  Using  the  extended  northwest-corner 
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method  an  initial  solution  is  obtained  with  at  most  mnp  ...  q - (m-1)  («-I)  (p-1)  . . . (^-1) 
nonzero  variables.  Such  a solution  is  called  a basic  solution.  Infeasibility  because  of  some  vari- 
ables being  negative  is  treated  as  in  the  three-indices  problem  by  Haley  [S]  and  is  described 
below. 


Starting  in  the  northwest  corner  Xm  n is  set  equal  to  min(x  n ,,,  X|  , n,  ....  Xm  ,, 
^111  1.)-  If  the  minimum  is  x u ,,  dashes  are  placed  for  variables  X211  ii>  ■’^31111*  -to  sig- 
nify a zero  level  for  these  variables.  Similarly  if  X|  , ,|  is  the  minimum  then  X121.11, 
^131  n-  • ■ • would  be  set  equal  to  zero,  etc. 

Next,  if  xiii  21  has  not  been  set  equal  to  zero  it  is  equated  to  min(x  n 21,  xi  1 21.  • • • , 
■*111  , 1 “ -"f III  ii<  Jf|ii  ,2,)-  Then  Xm  31,  Xm  41,  ...  are  allocated  till  Xm  j is  exhausted. 
Next,  if  Xm  12  has  not  been  set  equal  to  zero  it  is  equated  to  minfx  n 12,  xi  1 12 , ...  , 
■*111  -’flu  I.  ~^iii  ii)-  This  is  repeated  till  Xm  1 is  exhausted.  This  process  is  repeated 
till  all  the  variables  have  been  allocated. 


Infeasibility 


The  procedure  for  generating  an  initial  solution  may  lead  to  some  basic  variables  being 
negative.  To  remove  this  infeasibility  an  unknown  quantity  4>  is  added  to  the  basic  variable 
which  is  at  a negative  level.  Integer  multiples  {r,j^  f)  of  <t>  are  then  added  to  basic  or  nonbasic 
variables  and  subtracted  from  basic  variables  according  to  equation  (5), 


(5) 


£ ^/*.../  “ 0'  X '■y*  ■' 

/-I  i-\ 


>^-P 

0-  L '•-/A.../  - 0, 

A-l 


X '■->A.../ 
/-I 


- 0. 


This  ensures  complicance  with  the  feasibility  constraints  of  equation  (2). 


We  then  set  <t>  equal  to  min(x„*  ///■,;*  ,)  for  those  basic  variables  (occupied  cells)  which 
have  I <b  subtracted  from  them.  This  ensures  that  no  new  basic  variable  becomes  negative 
and  makes  the  largest  change  to  the  variable  at  the  negative  level.  We  then  add  , <6  to  or 
subtract  it  from  basic  or  nonbasic  variables  to  obtain  the  next  tableau. 


This  process  is  repeated  till  all  basic  variables  reach  a positive  level  (each  variable  at  a 
negative  level  is  treated  in  turn) . 


An  example  infeasible  solution  for  a four-indices  problem  is  shown  in  Tables  1 and  2. 
The  variable  Xim  is  at  a negative  level  of  -1.  In  order  to  resolve  infeasibility,  0 is  added  to 
Xiiii.  Since  0 has  been  added  to  the  first  row,  we  must  also  subtract  the  same  amount  from 
one  of  the  basic  variables  in  the  first  row;  thus  <f>  is  subtracted  from  Xnii.  This  is  continued  till 
equation  (S)  is  satisfied  for  each  /,  j.  k /. 

Next  the  value  of  0 is  determined  to  be  the  minimum  of  (4/1,  2/1,  6/1,  6/1,  1/1,  1/1, 
1/1,  2/1).  This  sets  0 at  1.  Using  this  value  of  0 we  transform  to  obtain  the  tableau  for  the 
next  iteration.  In  case  of  degeneracy  the  newly  feasible  variable  (in  this  case  Xmi)  is  kept  at  a 
zero  level  and  the  other  basic  variables  which  went  to  zero  (in  this  case  X11112,  X1222.  X2212)  are 
set  equal  to  an  infinitesimally  small  quantity  c. 

Evaluating  Dual  Costs 

For  the  N-indices  problem  each  occupied  cell  has  N dual  costs  associated  with  it, 
<r>A  />  ''ik  h w,y  f.  ....  . The  sum  of  these  dual  costs  equals  the  cost  of  the  variable. 

Thus 


I 
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TABLE  1.  Example  I.  Resolution  of  infeasihility  in 
four-indices  problem. 

Iteration  I.  Fourth  subscript  I = I 


7 = 1 

7 = 1 

7=  2 

7 = 2 

i=l 

-1  + (b 

* = 1 

4-  (b 

A'  = l 

allocation 

(=1 

2 - <b 

k=2 

2 + (b 

A =2 

/=2 

6 — (b 

k=l 

I + <b 

k=\ 

/=2 

0 + <b 

k=2 

6 — <b 

A=2 

TABLE  2.  Example  I.  Resolution  of  infeasibiUty  in 
four-indices  problem. 

Iteration  I.  Fourth  subscript  I = 2 


7=  1 

7=  1 

7 = 2 

7 = 2 

/=1 

I -<b 

A = 1 

6 + (b 

A = 1 

/=! 

1 + 

k=2 

1 - <b 

k=2 

i=2 

4 + <b 

A=1 

1 -<b 

A=1 

/=2 

1 II 

6 + <f) 

A=2 

(6)  + V,*  , + w,f  I + . . . + y,n  = C,y*.  ^ £ 

I 

i 

There  are  a total  of  mnp...  + mn...q  + mp...q  + ...  + np...q  dual  costs  (mnp...  dual  costs 
of  the  type  y,y*.  , etc.).  The  number  of  occupied  cells  (basic  variables)  has  been  shown  to  be  ' 

mnp...q  - {m-\)  (n-\)  (p-\)  ...  (q-\).  Thus,  the  number  of  dual  costs  to  be  set  equal  to  * 

zero  is  mnp...  + mn...q  + mp...q  + ...  + np...q  - lmnp...q  - (m-l)  (n-1)  (p-1)  ...  (</-!)].  . •* 

The  set  of  dual  costs  to  be  set  equal  to  zero  is  chosen  such  that  equation  (6)  is  satisfied  for 
each  of  the  basic  variables. 

Evaluating  Unoccupied  Cells 

I For  each  unoccupied  cell  the  quantity  , is  calculated  as  the  difference  between  the 

cost  of  the  variable  and  the  sum  of  dual  costs: 
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‘ilik.  l “ . I + P,* + •P,;.,./  + . . . + y,ik  , ). 

If  some  </„*  / are  negative,  then  the  variable  with  the  most  negative  / is  introduced  into  the 
basis.  If  all  </„*  , are  nonnegative  then  an  optimal  solution  has  been  obtained.  If  in  addition 
some  q,iu  are  zero,  then  multiple  optima  exist,  and  these  can  be  obtained  by  entering  the 
relevant  variables  into  the  basis. 

Altering  a Basic  Feasible  Solution 


An  unknown  quantity  9 is  added  to  the  entering  variable.  Integer  multiples  (//„*  ,)  of  9 
are  then  added  to  or  subtracted  from  the  occupied  cells  according  to  equation  (8)  to  maintain 
feasibility  and  to  continue  to  satisfy  equation  (2): 


(8) 


1^'  l-n  k-p  l-q 

£ ^ilk...l  “ 0>  ^ ^/;*.  / “0.  ^ ^ilk...l  “0,  ....  ^ tliih  I = 0. 

i-l  1-1  *-l  /-I 


The  altering  of  a basic  feasible  solution  with  the  use  of  multiples  of  9 proceeds  in  a manner 
similar  to  removing  infeasibilities  according  to  equation  (5). 


Obtaining  a New  Basic  Feasible  Solution 


We  set  9 equal  to  min (x, for  those  occupied  cells  which  have  , 9 subtracted 
from  them.  This  ensures  that  all  basic  variables  remain  nonnegative  (equation  (4)]  while  mak- 
ing the  largest  possible  reduction  in  the  objective  function.  We  then  add  ,,  9 to  or  subtract 
it  from  basic  variables  to  obtain  the  next  tableau. 

These  steps  are  repeated  till  an  optimal  tableau  is  obtained  with  all  j nonnegative. 
The  validity  of  this  procedure  follows  from  a extension  of  the  theorems  proved  by  Haley  (6). 

EXAMPLE  PROBLEM 

As  an  application  of  this  solution  procedure  an  example  problem  is  solved  which  involves 
four  indices. 

Formulation 

We  can  write  this  example  problem  as 

1-3  1-2  k-2  1-2 

minimize  I I £ £ ^oki 

/-I  /-I  *-i  /-I 


‘Zi 

£•*^1111  ” 13,  ^X,|2|  “ 7,  ^X,2||  *“  15,  5^  x,22|  “ 15, 

<-l  (-1  I-l  1-1 

,-}  ,.} 

1^-  £•’^(122”  ^•’^<212“  S -^(222  “ I®- 

'-I  /-I  i-l  ,-| 

;-2 

2<-’f|/2l  “ 3,  ^X2,||  — II,  ^X2,2|  “ 10, 

/-I  /-I  ;-l  /-I 

Xr -*^3(11  “ 10,  X'*3/21“9,  X-*^I/I2“9>  X'*^I/22“9, 

/-I  -I  I-l  I-l 


(9) 

subject  to 

(10) 


I 

I 


I 


t - 


I 
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21-’^3;12“9'  i'’C3,22“lS, 

,-l  /-I  /-I  /-I 

k-1  *-2  *-2  *-2 

^■’f|n|“S,  ]^->Ci2A|"5,  ^X2ui  = 9,  Xx22*i“12, 

k-1  k-1  k-1  k-l 

k-2  k-2  *-2  k^ 

51-*3lkl“6'  £3r32kl  = 13.  ^->f|n2“6. 

k-1  k-1  k-1  k-1 

X •*’21k2  “ X-’^22k2“ll'  X ■’‘^31k2  “ ^0-  X •’^32k2  “ 14, 

k-1  k-1  k-1  k-1 

/-2  1-2  1-2  1^ 

X-’^iii/“5,  X3Cii2/=“6, 

1-1  /-I  1-1  /-I 

X'*21W“14,  ^X221/“12,  ^•>C212/“9,  ^3(’2221  “11* 

1-1  1-1  1-1  l-l 

£-<■3111  “8,  ]^3f32ll“ll'  ^3^312/”  8,  X -’‘'3221  ” 16, 

1-1  1-1  1-1  1-1 

where  x x,  */,  x,,  ;,  and  x,;*  satisfy  the  following  equalities; 


1-2 

Z-’^./ii  “ 
/«] 

7-3 

X-^'.ii’ 

l-\ 

1-2 

Z ^./21  = 
/-) 

X -*^'.21' 

7-1 

1-2 

(-3 

1-2 

7-3 

Z^r.iu- 

/•I 

X 1 1* 

l-\ 

Z -'3,122  = 

/-I 

X''^'.22' 

7-1 

fc-2 

,-y 

A-2 

“ X-^f2.I* 

X I A 1 

X^7M. 

' Z-*'.2kl 

A-1 

7-1 

k-1 

7-1 

k-2 

/-3 

k-2 

7-3 

X u 2 ““ 

X '^/I.2‘ 

■ Z ^.2k2 

“ X-’^/2,2^ 

A-I 

7-1 

k-l 

7-1 

/-2 

7-3 

1-2 

7-3 

X -^.11/  * 

/.l 

X^^ii' 

7-1 

Z 121  “ 
1-1 

X -*^'12' 

7-1 

1-2 

Z-’‘,21. 

1-1 

1-2 

7-3 

X -^.21/  * 

l-\ 

Z -’‘.221  “ 
1-1 

X-^'22» 

7-1 

k-2 

f-2 

k-2 

/-2 

X “ 

L^fi/.i 

. Z3‘2.kl 

- Z^fj^.i, 

k-l 

1-1 

k-l 

1-1 

k-2 

1-2 

k-2 

i-2 

Z Jf3.kl  - Z3f3i. 

!■  Z ■’‘l.k2 

“ Z-’‘ll.2' 

A-l 

/-I 

k-1 

1-1 

k-2 

1-2 

k-2 

1-2 

Z -*’2. *2  ■ 

•! 

■ L3f2,,; 
/-I 

2.  Z-*'3.k2 

A-l 

“ Z -*3/, 2^ 

i-\ 

1-2 

/-2 

1-2 

/-2 

Z'*I.W  “ 
1-1 

Z^fi/i. 

' Z-’‘l.21 
1-1 

- Z3fl,2,. 
/-> 

1-2 

/-2 

1-2 

i-2 

Z'’‘2,ll  “ 
/•; 

Z ■*2/1. 
/.) 

- Z-’‘'2.21 
1-1 

“ Z-’‘2i2.> 

/-I 

1-2 

i-2 

1-2 

/-2 

Z-*'3,ll  ” 

/-I 

Z-»‘3,l, 

1-1 

> Z-’‘3,21 
1-1 

“ Z-’‘3,2.- 
1-1 
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1-2  k-2  1-2  k-2 

1-1  k-\  1-1  k-l 

1-2  k-2  1-2  k-2 

£•’‘'21./='  £ '<'2U,>  £■’‘'22,;='  £•<'22/1,- 
/-I  *-l  1-1  k-\ 

1-2  k-2  1-2  k-2 

£-’‘'31,/='  £■’‘31*.-  £■’‘'32,/=  £-<'32*,- 

1-1  *-l  /-I  k-l 


^i/kl  ^ 0 


Vij.k.L 


Solution 

Following  the  procedure  outlined  above,  an  initial  basic  feasible  solution  to  the  example 
problem  is  presented  in  Tables  3,  4,  and  5.  The  variables  are  allocated  according  to  equation 
(10)  and  the  end  requirements  satisfy  equation  (11).  The  cost  of  allocations  in  the  initial  basic 
feasible  solution  is  given  by  equation  (9)  to  be  441. 

TABLES.  Example  problem.  Initial  basic  feasible  solution. 

Fourth  subscript  1=1.  Note  m = 3,  n = 2,  p = 2,  a = 2. 


*=1 

/ 

2 

T^ 

1 

1 

A:-l 

3 

1 

k-\ 

10 

15 
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TABLE  4.  Example  problem.  Jnitial  basic  feasible  solution. 
Fourth  subscript  1=2. 


14 

19 

x./u 

16 

18 

TABLES.  Example  problem. 
Initial  basic  feasible  solution. 
Summation  over  all  values  of  fourth 
subscript  I = 1,2 
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For  this  problem  m = 3,  n = 2,  p = 2,  and  q = 2.  Thus  there  are  (ninpq=)  24  variables 
and  \mnpq  — (w  — 1)  O/  — I)  (/>  — !)  (</  — 1)=-]  22  of  these  are  in  the  basis  at  a given  time.  There 
are  a total  of  (mnp  -I-  nitiq  -I-  mpq  + npq  ■=)  44  dual  costs  and  since  there  are  a total  of  22 
independent  variables,  (44-22  = ) 22  of  the  dual  costs  may  be  .set  equal  to  zero.  The  set  of  dual 
costs  to  be  set  equal  to  zero  is  chosen  such  that  equation  (6)  is  satisfied  for  each  of  the  basic 
variables. 

Work  sheets  for  proceeding  from  one  iteration  to  the  next  are  shown  in  Tables  6 and  7. 
The  work  sheets  display  the  values  of  the  basic  variables  x;,*,,  cost  of  each  variable  t;,*,,  dual 
costs  M,*,,  v,^/,  w,,,,  and  , as  well  as  (the  difference  between  cost  of  the  nonbasic  variable 
and  the  sum  of  dual  costs).  Of  the  two  nonbasic  variables  Xjm  and  X||i2  only  the  latter  has 
q,ii^i  negative.  Hence  xmj  enters  the  basis  in  the  next  iteration. 

An  unknown  quantity  9 is  added  to  x,ii2  and  9 is  added  to  and  subtracted  from  some 
of  the  occupied  cells  according  to  equation  (8).  Then  the  value  of  9 is  determined  from  those 
cells  which  have  a multiple  of  9 subtracted  from  them; 

9 = min(x„*, /«„*,) 

= min  (5/1,  3/1,  3/1,  1/1,  9/1,  6/1,  6/1,  2/1) 

= 1 

TABLE  6.  Example  Problem. 

Iteration  1. 

Fourth  subscript  / = / 


y - 1 

U|i|  - 0 

y - 1 

U|2|  - 0 

y = 2 

W211  = 0 

./=  2 

W221  = 0 

•*‘'1111  “ 5 — 9 

■’*1211-2-1-0 

/-  1 

Will  - 3 

w,2i  = 2 

V,,,  - 0 

-Pill  - 3 

>121  “ 4 

Clin  “ 6 

C1211  ” 6 

■*^1121  “ < + 9 

■**  1221  = 3-0 

/ = 1 

•Pill  “ 3 

**'121  “ 2 

*'121  - 0 

>112  ” 5 

>122  - 3 

C1121  “ 8 

c 1221  “ 5 

•*■2111  “ 8 -1-  fl 

•’*2211  “3  — 0 

/ “ 2 

**'211  - 1 

W221  = 10 

*'211  “ 0 

.P211  “ 3 

>221 7 

C2U1  - 4 

C2211  ” 3 

•*^2121  “1—0 

■’*2221  = 9-1-0 

/ - 2 

**’211  ” 1 

**’221  “ 10 

V221  “ 0 

.P212  - 1 

>222 6 

C2121  “ 2 

C2221  “ 4 

^3,11  - 10 

■’*3211  “ 10 

/ - 3 

wjii  - 0 

**’321  “ 1 

*'311  ” 

.V311  - 7 

>322  ” 8 

C3111  - 9 

C32II  - 1 

•’*3121  “ 6 

■**3221  “ 3 

i — 3 

**'311  - 0 

**'321  - 1 

0 

1 

>312  - 6 

>322  - 1 

C3121  " 6 

CM 

1 

•“S 

; 
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TABLE  7.  Example  Problem. 
Iteration  I. 

Fourth  subscript  I — 2 


y - 1 
«112  * 0 

7 - 1 
"122-0 

y “ 2 
"212  - 0 

y - 2 

"222  — 0 

‘11112  “ -1 

•’^1212  =9-0 

i - 1 

•*'112  = 0 

**'122  “ 0 

V,,2  = 0 

>'111  ” 3 

>121  “ 4 

C1112-  2 

. 

C 1-)12  = 4 

1122  ” 6 — 

^ 1222  = 3 + 0 

/=  1 

K'112  ” 0 

**’122  “ 0 

0 

II 

.V|(2  = 5 

>122  “ 3 

C1122  = 5 

f 1222  = 3 

-’*'2112  = 6 — 

-*■2212  “ 9 d 

/ “ 2 

**'212  = 7 

*♦'222  — 16 

•'212  “ ”8 

>"211  = 3 

>221 7 

‘’2112  “ 2 

^2212  = 1 

•*^■2122  “ 8 -I-  ^ 

■*^2222  = 2—0 

i ” 2 

**'212  = 7 

H'222  = 16 

*'222  “ 0 

>■212  = I 

>222  “ -6 

C2122  “ 8 

*"2222  = 10 

■**■3112  = 8 

•*■3212  = 1 

i - 3 

**'312 3 

**'322  — 1 

1/3,2  - 0 

>"311  “ 7 

>321  “ 8 

C3112  ” 4 

C3212  = 9 

■’^3122  = 2 

•*^3222  = 1 3 

i “ 3 

**'312  = -3 

**'322  “ 1 

*'322  - 0 

>'312  “ 6 

>322  - 1 

^3122  = 3 

C3222  “ 2 

. Next  is  added  to  or  subtracted  from  basic  variables  to  yield  Tables  8 and  9 of  itera- 

tion 2.  The  dual  costs  are  evaluated  as  before.  Since  the  associated  with  both  the  nonbasic 
variables  X2121  and  X3111  are  nonnegative,  an  optimal  solution  has  been  obtained.  Since  both 
the  q,ni  are  positive,  we  have  a unique  optimal  point.  The  minimum  cost  is  440. 

A computer  program  has  been  written  to  solve  any  four-indices  transportation  problem. 

Interpretation  of  End  Requirements 

For  a four-indices  transportation  problem  involving  minimization  of  transportation  costs 
between  factory  /,  warehouse  7,  wholesale  outlet  k,  and  retail  store  /,  the  following  definitions 
are  valid  for  the  end  requirements  x x,  /,,,  x,,  ,,  and  x,fn : 

X - amount  shipped  from  warehouse  j via  wholesaler  k to  retailer  4 

AT,  — amount  shipped  from  factory  ; via  wholesaler  k to  retailer  /, 

Xiij  — amount  shipped  from  factory  i via  warehouse  j to  retailer  /, 

x,,*  — amount  shipped  from  factory  i via  warehouse  j to  wholesaler  k. 

( These  end  requirements  can  be  determined  for  any  practical  problem  from  the  total  sup- 


ply at  factory  /,  inventory  capacity  of  warehouse  y,  inventory  capacity  of  wholesaler  k,  total 
requirement  at  retailer  /,  and  equation  (3).  < 
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TABLE  9.  Example  Problem. 
Iteration  2. 

Fourth  subscript  1 — 2 


y - I 

M|,2  “ 0 

y - 1 

Mi22  ” 0 

y - 2 
"212  “ 0 

y'-  2 
“222  “ 0 

1112  “ 1 

■’‘  1212  “ 8 

/ - 1 

**'112  “ -4 

**’122  “ ~2 

v,,2  - 0 

>'111  ” 6 

>-121  “ 6 

fiiu  " 2 

‘'  1212  * 4 

■’‘1122  “ 5 

■*‘1222  “ 4 

/ - 1 

**'112  ” -4 

**'122 2 

‘*122  “ 2 

>'112  “ 2 

>'122  “ 3 

‘'1122  “ 5 

‘'1222  * 3 

■’*'2112  “ 5 

■’‘2212  “ 10 

; - 2 

**’212  “ 0 

**'222  “ 

*'212  “ 1 

>'211  “ 1 

>'221  - 1 

‘'2112  “ 2 

‘'2212  * 1 

■’‘2122  “ 9 

■’‘2222  “ 1 

/ - 2 

**'212  = 0 

**'222  * -1 

1-222  “ 10 

>'212  “ -2 

>'222  * 1 

‘'2122  “ 8 

‘'2222  “ 10 

•’*'3112  ” 8 

■’‘3212  “ 1 

/ - 3 

**’312 4 

**'322  “0 

V3,2  - 9 

>'311  ” ~1 

>'321  - 0 

‘'3122  ” 4 

‘‘3212  “ 9 

■’‘3122  “ 2 

’*3222  “ 13 

/ » 3 

**'312 4 

**'322  = 0 

*'322  “ 0 

>'312  “ 2 

>'322  “ 2 

‘'3122  ” 3 

‘'3222  “ 2 

For  another  four-indices  transportation  problem  involving  minimization  of  transportation 
costs  between  factory  / and  retail  store  / for  product  j and  by  mode  of  transportation  k,  the  fol- 
lowing definitions  would  apply  for  x,  ^i,  x,,  /,  and  x,^  : 

X — amount  shipped  of  product  j by  mode  k to  retailer  /, 

X,  — amount  shipped  from  factory  / by  mode  k to  retailer  /, 

x,i  i - amount  shipped  from  factory  / by  product  j to  retailer  /, 

x,,*  — amount  shipped  from  factory  / of  product  j by  mode  k. 


APPENDIX 

For  the  rwo-indices  transportation  problem  Hadley  l4],  Dantzig  [3],  and  Simonnard  and 
Hadley  (11)  have  all  shown  that  the  number  of  independent  variables  for  a problem  with  m 
sources  and  n destinations  is  (m  + n — 1).  This  is  equivalent  to  ll  + (/w  — 1)  + (n  — 1)) 
independent  variables. 

Schell  (lOl  and  Bammi  (ll  have  shown  that  the  number  of  independent  variables  in  a 
r/?rpe-indices  transportation  problem  with  m factories,  n warehouses,  and  p types  of  products  or 
p retail  stores  is  {mnp  - (m-1)  (w-l)  (p-D).  This  is  equivalent  to  (1  -t-  (m-l)  + (n-1)  4- 
(p-1)  + (m-l)  (n-1)  (m-l)  (p-1)  + (n-1)  (p-l)l  independent  variables. 
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The  number  of  independent  variables  in  a four-indices  problem  with  m factories,  n 
warehouses,  p wholesale  outlets,  and  q retail  stores  is  given  by  [mnpq  — (m  — 1)  (n  — 1)  (p—1) 
{(/-1)1,  Changing  to  a general  notation  and  replacing  m by  m ,,  n by  m2,  p by  m3,  and  q by  m 4 
it  can  be  stated  that  the  number  of  independent  variables  in  the  four-indices  problem  equals 


one  plus 


plus 


variables  of  type  (m,  - 1),  plus 


variables  of  type  (m,  - 1)  (m,  -1),  J > /, 
variables  of  type  (m,  - 1)  (m^  - 1)  (m*  - 1)  (m*  - 1),  j > i,  k > j. 


THEOREM:  The  number  of  independent  variables  in  the  N-indices  transportation  problem  is 

one  plus  the  sum  of  the  products  over  all  combinations  of  (m,  - I)  taken  1,2 (N-I)  at  a time 

from  N. 

Proof  of  the  theorem  is  by  induction.  The  (N  -I-  l)-indices  theorem  is  shown  to  be  true 
if  the  ^indices  theorem  holds,  then  the  ^indices  theorem  is  true  if  the  (A'-l) -indices  theorem 
holds,  and  so  on.  The  four-indices  theorem  is  true  if  the  three-indices  theorem  holds.  The 
three-  and  two-indices  theorems  have  already  been  shown  to  be  true.  Therefore  the  four-, 
five-,  ...  , (A/-1)-,  and  A'-indices  theorems  are  also  true. 

PROOF:  It  will  be  shown  that  the  (A^-E  l)-indices  theorem  is  true  if  the  ^indices 
theorem  holds. 


There  are  m\  nti  m^ ...  m^y  m^+i  variables  in  the  (A/-I- 1) -indices  problem.  All  these  vari- 
ables are  contained  in  a hypercuboid  of  {N+\)  sides  the  dimension  of  whose  sides  are 

m,,  m2,  m3 m^v,  m/v+i  (hereafter  called  the  first  hypercuboid).  Consider  the  next 

smaller  hypercuboid  of  A-l-l  sides  having  dimensions  m|  - 1,  m2  - 1,  m3  - 1,  ...  , m,v  - 1. 
m,v+i  - 1 (hereafter  called  the  second  hypercuboid)  and  located  at  one  of  the  vertices  of  the 
first  hypercuboid.  Next  consider  the  A^  -E  1 hypercuboids  of  A -E  1 sides  and  dimensions 

mj  - 1,  mj  - 1,  m3  - 1 m,^i  - 1,  l,m,+|  - 1,  ....  m^/  - 1,  m,v+i  ~ 1 (referred  to  as 

the  third  type  of  hypercuboid).  The  variables  in  these  N -f  1 hypercuboids  of  the  third  iypt  are 
dependent  on  the  variables  in  the  second  hypercuboid.  To  prove  this  linear  dependence,  the 
A-indices  theorem  is  applied  to  a multidimensional  vector  in  one  of  /V  -E  1 hypercuboids  of  the 
third  type  and  the  corresponding  set  of  m,  - 1 vectors  in  the  second  hypercuboid.  The  N- 
indices  theorem  is  applied  in  this  fashion  to  each  of  the  m,  - 1 vectors  in  each  of  the  A'  -E  I 
hypercuboids  of  the  third  lypt. 


The  above  application  of  the  Al-indices  theorem  indicates  that  the  number  of  independent 
variables  in  the  secowi/ hypercuboid  is 
v + i 

(m,  - 1)  (m2-  1)  (m3-  1)  ...  (m,_,  - 1)  1 («,.,,  - 1)  ...  (m^+,  - 1). 

where  there  is  one  such  term  for  each  of  the  A -E  1 faces  of  the  second  hypercuboid. 

Consider  next  the  /V  -E  1 hypercuboids  of  A -E  1 sides  and  dimensions  m,  - 1,  mj  - 1, 

m3  — 1,  ...  , m,_2—  1,  1,  1,  m,  + | — 1 m^  - 1,  m/y+i  — 1 (referred  to  as  the  fourth  type 

of  hypercuboid).  By  application  of  the  (N  - l)-indices  theorem  it  can  be  shown  that  the 
number  of  independent  variables  in  all  hypercuboids  of  the  fourth  type  is 

N N 

L - 1)  (m2-  1)  (m3-  1)  ...  (m,.2-  Dll'  (m,^,  - 1)  ...  (m^,,  - 1). 

I-l-l  t-1 
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In  this  manner  the  N-,  (.N  — 1)-,  (N  — 2)- 3-,  and  2-indices  theorems  are  applied  to 

get  the  total  number  of  independent  variables  in  the  (N  -i-  l)-indices  problem.  Thus,  the 

(N  + I)-indices  theorem  stands  proved  if  the  N-indices  theorem  is  true.  i 

Since  the  2-  and  3-indices  theorems  are  true,  the  higher-order  theorems  are  also  true. 
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ABSTRACT 

Suppose  we  are  given  a network  G-(V.E)  with  arc  distances  and  a linear 
cost  function  for  lengthening  arcs.  In  this  note,  we  consider  a network- 
interdiction  problem  in  which  the  shortest  path  from  source  node  s to  sink 
node  I is  to  be  increased  to  at  least  t units  via  a least-cost  investment  strategy. 
This  problem  is  shown  to  reduce  to  a simple  minimum-cost-flow  problem.  Ap- 
plications and  generalizations  are  discussed,  including  the  multiple-destination 
case. 


Let  C-(  y.E)  be  a directed  graph  with  vertex  set  V,  arc  set  E,  arc  distances  dix,y),  and 
per-unit  costs  c(x.y)  for  increasing  the  length  or  travel  discomfort  of  arc  (x.y)  . That  is,  the 
cost  of  increasing  the  length  of  arc  (x.y)  by  8(jr,y)  is  given  by  c(x,y)6U,y).  Nodes  sand  fare 
distinguished  as  source  and  sink  nodes,  respectively.  In  a recent  paper,  Fulkerson  and  Harding 
(41  consider  the  problem  of  maximizing  the  length  of  the  shortest  path  from  sto  / given  a fixed 
budget  b.  They  demonstrate  that  the  computation  required  is  equivalent  to  the  parametric  solu- 
tion of  a minimum-cost-flow  problem.  In  this  note,  we  focus  on  a closely  related  problem  in 
which  the  length  of  the  shortest  path  from  node  s to  node  / is  to  be  increased  to  at  least  t units 
via  a least-cost  investment  strategy.  This  problem  is  shown  to  reduce  to  a simple  minimum- 
cost-flow  problem. 

An  important  application  of  this  problem,  as  mentioned  by  Fulkerson  and  Harding  [4], 
concerns  an  opponent’s  supply  of  information,  manpower,  or  equipment  from  a source  j€  Fto 
a destination  / € V.  Increasing  the  length  of  the  shortest  path  to  at  least  r units  ensures  a 
predetermined  level  of  difficulty  for  the  opponent.  Equivalently,  the  problem  can  be  viewed  as 
one  concerning  network  reliability.  If  each  arc  (x,y)  has  a probability  p{x,y)  of  being  opera- 
tive, then  the  interdiction  problem  becomes  one  of  guaranteeing  at  minimal  cost  that  the  most 
reliable  path  from  s to  / have  no  more  than  a prespecified  chance  of  being  operative.  This 
equivalence  becomes  apparent  upon  taking  the  logarithm  of  the  product  of  the  arc  reliabilities. 

The  network  interdiction  problem  is  formulated  as  follows: 

min  ^ c(x,y)  6(x,y), 

(1-2)  7r(f)  — ir(j)  > T, 

^•■3)  7r(y)  - n(x)  - 8(x,y)  < dCx.y),  (x,y) € £. 

(14)  8(x,y)  ^ 0. 
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The  decision  variable  7r(*v)  is  a node  potential  which  represents  the  length  of  the  shortest  path 
from  node  s to  node  w when  7r(s)=“0;  the  decision  variable  indicates  the  nonnegative 

increase  in  distance  on  arc  Equations  (1.3)  guarantee  that  the  node  potentials  reflect 

shortest  path  lengths.  Equation  (1.2)  constrains  the  shortest  path  from  s to  t to  be  no  shorter 
than  T units. 


If  we  assign  dual  variables  z and  fix.y)  to  equations  (1.2)  and  (13)  respectively,  the 
linear  programming  dual  of  (1.1)  - (1.4)  becomes 


(2.1) 

(2.2) 

(2.3) 


max  Tz—  ^ fix.y)  dix.y) 


S fix,J)  - X fim.x) 


z for  X — V. 

0 for  X 6 f'-li.r I. 
— z for  X - I, 


0 ^ fix.y)  ^ f ix.y), 


where 

■4  (x)  - l^e  Kl(x,y)e£) 
and 


Bix)  - [m€  y\(m.x)^E\. 


Note  that  for  sufficiently  large  values  of  t,  where  tz  dominates  the  objective  function,  the 
above  program  seeks  a maximal  (low  solution  that  minimizes  the  summation  in  (2.1).  The  pro- 
gram imposes  a premium  of  t units  on  every  unit  of  flow  that  reaches  node  /,  and  a penalty 
dix.y)  on  every  unit  of  flow  on  arc  ix,y).  If  we  view  the  objective  function  (2.1)'  as  an 
equivalent  minimization  model,  we  have  a minimum-cost-flow  problem  where  zcan  be  thought 
of  as  the  flow  on  the  "return  arc"  from  r to  s. 

The  relationship  between  our  network-interdiction  problem  and  the  maximum  minimum- 
path  problem  iMMPP)  treated  by  Fulkerson  and  Harding  is  rather  straightforward.  An  optimal 
solution  (it*, 8*)  to  problem  (1.1)  - (1.4)  is  a feasible  solution  to  the  MMPP\i 

^ cix.y)  h*ix.y)  < b. 

In  this  case,  when  t is  increased  until  the  minimum  cost  solution  is  equal  to  b,  we  have  solved 
the  MMPP.  If  (it  *,8*)  is  not  budget  feasible,  then  decreasing  t will  solve  the  MMPP. 

Now  suppose  the  interdiction  task  requires  that  shortest  paths  from  s to  nodes  Kl), 
f(2).  , tik)  be  no  less  in  length  than  t(1),  t(2) rik)  units  respectively.  This  multi- 

ple destination  problem  can  be  written 

(3.1)  min  2 ‘‘(x^)  8(x,>’), 

(x.yXf 

w(/(/))  - 7r(.v)  > t(/),/  “ 1,2 k. 

niy)  - irix)  - 8(x,j')  < dix.y),  ix.y)  € E, 

8(x.j')  > 0. 


(3.2) 

(3.3) 

(3.4) 
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The  dual  problem,  given  below,  is  again  a minimum-cost-llow  problem. 


(4,1) 


max 


^T(/)r(/)-  ^ Jix.y)  dix.y) 

'-I  («.>)  t h 


(^■2)  ^ /(xj)  — ^ f(m,x)  — 

/♦<(<>  mtH(x) 


^z(i)  for  X — S, 

.-I 

0 for  xf^  - (.s,f(l),  , i{k)\, 

—zii)  for  X - /(/), 


<'»  3)  0 < /(x.y)  ^ c(x.y). 

with  Aix)  and  B(x)  defined  as  before. 


The  minimum-tost  network  flow  problem  has  been  the  focus  of  much  research  attention 
in  recent  years.  For  example,  Barr  et  al.  (2),  Hatch  |5l,  and  Aashtiani  and  Magnanti  (I)  use 
special  data  structures  to  reduce  the  computation  time  of  the  out-of-kilter  primal-dual  algorithm 
by  an  order  of  magnitude.  Problems  with  thousands  of  nodes  and  arcs  are  now  solved  in  a 
matter  of  seconds.  Researchers  at  the  Center  for  Cybernetic  Studies,  University  of  Texas,  have 
also  constructed  very  efficient  special-purpose  primal  simplex  codes  for  solving  these  problems. 
Since  the  network  interdiction  problem  can  be  viewed  as  a minimum-cost-llow  problem,  it  can 
be  solved  easily  and  quickly. 
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ABSTRACT 

Adequate  prediction  of  a response  variable  using  a multiple  linear  regres- 
sion model  is  shown  in  this  article  to  be  related  to  the  presence  of  multicol- 
linearities  among  the  predictor  variables.  If  strong  multicollinearities  are 
present  in  the  data,  this  information  can  be  used  to  determine  when  prediction 
is  likely  to  be  accurate.  A region  of  prediction,  R.  is  proposed  as  a guide  for 
prediction  purposes.  This  region  is  related  to  a prediction  interval  when  the 
matrix  of  predictor  variables  is  of  full  column  rank,  but  it  can  also  be  used 
when  the  sample  is  undersized.  The  Gorman-Toman  ten-variable  data  is  used 
to  illustrate  the  effectiveness  of  the  region  R. 


1.  INTRODUCTION 

Prediction  of  future  observations  is  one  of  the  primary  uses  of  an  estimated  linear  regres- 
sion model.  Although  a large  number  of  papers  and  books  have  been  written  on  the  analysis  of 
regression  data,  the  emphasis  in  the  literature  is  heavily  weighted  toward  problems  of  model 
building  and  estimation  of  model  parameters,  and  not  on  recommendations  for  using  prediction 
equations.  While  these  problems  are  all  related,  they  do  not  necessarily  place  the  same 
demands  on  the  estimated  model. 

Currently  much  of  the  statistical  literature  on  linear  regression  is  focussing  on  properties 
of  biased  regression  estimators.  Notable  articles  include  those  of  James  and  Stein  [10],  Hoerl 
and  Kennard  [9],  Marquardt  [14],  Lindley  and  Smith  112],  Hawkins  [7],  and  Webster,  Gunst, 
and  Mason  [23].  Biased  estimation  is  receiving  such  prominence  due  to  the  realization  that 
multicollinearity  among  the  predictor  variables  (defined  in  section  2)  tends  to  severely  distort 
the  least-squares  estimates  of  the  regression  parameters.  This  in  turn  can  result  in  poor  predic-  j 

tion  of  future  responses.  Subset  selection  procedures  likewise  are  not  immune  to  distortion  in 
the  presence  of  multicollinear  data. 

Underlying  this  need  for  good  parameter  estimates  is  the  assumption  that  the  fitted  model 
is  to  be  used  to  predict  over  a wide  region  of  interest  of  the  predictor  variables,  perhaps  an 
entire  rectangular  region  defined  by  the  extreme  values  observed  on  each  predictor  variable. 

*Thix  research  was  sponsored  in  pari  by  ihe  Air  Force  Office  of  Scientific  Research.  Air  Force  Systems  Command, 

USAF,  under  Grant  No  AFO.SR-75-2871 
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This  may  be  an  unduly  stringent  assumption  as  Hocking  (81  discusses  from  a variable  selection 
viewpoint.  In  other  words,  frequently  it  is  not  ncessary  to  predict  over  such  a wide  region 
When  this  is  so,  accurate  predictions  may  be  possible  despite  uncertainties  about  the  goodness 
of  individual  parameter  estimates. 

The  purpose  of  this  paper  is  to  better  identify  when  prediction  is  likely  to  be  accurate  with 
multicollinear  data.  Specifically,  this  paper  was  stimulated  by  three  problems  noticed  by  Owen 
and  Reynolds  (18)  in  their  development  of  a prediction  equation  for  estimating  engineering 
manhours  for  proposed  aircraft  programs: 

1.  they  decided  to  include  no  more  than  12  predictor  variables  from  a total  of  about  60 

possible  ones  since  only  23  observations  on  the  response  variable  (engineering 
man-hours)  were  available;  * 

2.  a backward  elimination  [2,  Chapter  6)  procedure  was  performed  to  reduce  further 
the  number  of  predictor  variables  to  eliminate  "the  possibility  of  the  accidental  dele- 
tion of  a significant  variable  due  to  its  interacton  with  other  variables";  and 

3.  the  authors  concluded  that  "some  limits  of  extrapolation  for  formulas  should  be  a 
primary  objective  of  future  studies." 

We  will  address  each  of  these  problems  in  subsequent  sections,  not  with  the  goal  of  providing 
final,  definitive  solutions  to  them,  but  rather  to  show  how  each  afl'ects  the  estimation  of  the  t 

regression  parameters  and  the  use  of  the  resulting  prediction  equation.  We  do  not  intend  to  • 

argue  that  any  particular  estimator  is  the  best  one  to  use  with  multicollinear  data.  We  will, 
however,  point  out  some  advantages  of  using  a principal  component  estimator  to  obtain  a pred- 
iction equation. 


2.  LEAST-SQUARES  PREDICTION  EQUATIONS 

In  this  section  we  will  examine  problems  2 and  3 of  Owen  and  Reynolds,  which  were 
listed  in  the  previous  section.  Suppose  the  assumed  linear  regression  model  is  written  as 

(1)  \ - fij  + xp  +1, 


where  Y is  an  x I)  vector  of  observations  on  the  response  (dependent)  variable,  1 is  an 
in  X 1)  vector  of  ones,  T » IX|,  Xj X^l  is  an  in  x p)J'ull  column  rank  matrix  of  predic- 

tor (independent)  variables,  Ai  is  an  unknown  constant,  /3  is  a (pxl)  vector  of  unknown 
regression  parameters,  and  « is  an  (n  x 1)  vector  of  unobservable  random  error  terms  with  t — 
A((0,  tT^  1).  For  simplicity,  we  assume  that  the  elements  of  X,  are  standardized  so  that  X,  1 — 0 

and  X,  X,  — 1 7 — 1,2 P.  Finally,  model  (1)  is  assumed  to  represent  adequately  the 

response  variable  although  some  of  the  predictor  variables  may  not  be  needed  for  adequate 
prediction 

The  three  problems  cited  by  Owen  and  Reynolds  result  from  inadequacies  in  the  data 
used  to  estimate  the  model  parameters,  in  particular,  from  multicollinearitics  in  the  data.  A 
multicollinearity  can  be  defined  as  a linear  combination  of  the  columns  of  X that  is  nearly  zero. 
This  implies  that  T'A'is  nearly  singular.  A multicollinearity  is  not  necessarily  due  to  some  vari- 
( ables  being  redundant  in  the  specification  of  the  model,  but  they  may  be  redundant  for  the  data 

collected. 
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Redundant  model  variables,  those  variables  that  will  be  redundant  for  all  samples  of  data, 
can  and  should  be  deleted  from  the  model  since  they  serve  only  to  inflate  the  variance  of 
predicted  responses  (see,  e.g.,  [8]).  If  the  redundancy  is  inherent  only  in  the  particular  data 
sampled,  it  is  dangerous  to  remove  them  from  the  predictor  since  the  estimated  model  may 
then  be  biased  when  future  responses  are  predicted.  Yet  multicollinearities  tend  to  cause  the 
deletion  of  one  or  more  of  the  multicollinear  variables  merely  because  they  are  involved  in 
multicollinearities,  not  because  they  are  worthless  predictor  variables. 

To  see  this  latter  point,  denote  the  eigenvalues  of  X'X  by  /|  ^ (2  ^ . . . > and  the 

corresponding  eigenvectors  by  V,,  Vj V„.  If  there  are  one  or  more  multicollinearities 

among  the  columns  of  X,  one  or  more  of  the  eigenvalues  of  X'X  will  be  nearly  zero.  For 
eigenvalues  that  are  near  zero,  multicollinearities  can  be  identified  by  noting  that 

<2>  = 0-  t i/„x,  = o. 


Equation  (2)  shows  that  the  eigenvector  V,  corresponding  to  a small  eigenvalue  /,  provides  the 
coefficients  for  the  linear  combination  of  the  columns  of  X causing  a multicollinearity.  Natur- 
ally, the  larger  elements  in  V,  identify  the  predictor  variables  most  strongly  multicollinear. 
Reference  (16)  contains  a more  complete  discussion  of  multicollinearities  and  the  problems 
associated  with  them. 

The  least-squares  estimator  of  p for  the  model  specified  by  (1)  is 

(X'X)-'X’\. 

The  variances  and  covariances  of  the  p,  can  be  found  from 

(3)  Varlll  = i/r'V,V,o-^ 

/-I 

From  (3)  we  can  see  that  small  eigenvalues  in  X' X will  result  in  large  variances  and  covariances 
for  estimated  parameters  of  variables  involved  in  multicollinearities  (those  with  large  F values 
in  (2)). 

When  attempting  to  reduce  the  number  of  variables  in  the  prediction  equation,  the  t 
statistic  commonly  used  to  test  H„:  /3,  = 0 is 

(4)  t =/§/((•„  MSE)''^ 

where  is  the  j th  diagonal  element  of  (X'X)~^  and  MSE  is  the  estimate  of  <t^  computed  from 
the  full  model  (I).  Since  the  values  of  variables  involved  in  multicollinearities  tend  to  be 
large  due  to  the  small  /,  in  (3),  the  r statistics  corresponding  to  these  variables  tend  to  be  small. 
This  accounts  for  the  tendency  for  variables  to  be  deleted  by  some  computer  programs  because 
of  their  "interaction  with  other  variables". 

Contrary  to  Owen  and  Reynolds’  supposition,  backward  elimination  also  suffers  from  this 
problem.  Backward  elimination  deletes  the  variable  with  the  smallest  t statistic  at  each  stage. 
Since  multicollinear  variables  tend  to  have  small  / statistics,  at  least  one  multicollinear  predictor 
variable  is  likely  to  be  deleted  from  the  model.  See  (6)  for  an  illustration  of  this  property. 
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The  problem  o(  eliminating  important  predictor  variables,  problem  2 of  the  previous  sec- 
tion, is  thus  dir^’tly  related  to  multicollinearities  in  the  data.  Multicollinearities  in  the  data 
used  to  estimate  fi  may  affect  prediction  even  if  all  the  predictor  variables  are  used  in  the  pred- 
iction equation.  Write  the  least  squares  prediction  equation  as 

(5)  y u'J 

where  /So  = F and  u is  a vector  of  values  of  the  p-predictor  values  which  are  standardized  as  in 
(1). 


Since  Y generally  estimates  /Sq  well,  (5)  will  be  an  adequate  predictor  of  the  response  if 


u'^=  u'/3 


for  all  values  of  u in  some  region  of  interest.  Now  u'  /3  is  an  unbiased  estimator  of  u'  /S,  with 
variance 

Var[u'^]  =“  (T^  u'(A"A')"'u 


(6)  ^ ir^  ^ I,  ' (u'\ y. 

/-I 


It  can  be  seen  from  (6)  that  Varlu'  p]  will  be  unacceptably  large  for  many  points  u if  one  or 
more  of  the  /,  are  sufficiently  small,  or  some  of  the  u'V,  are  large. 

A commonly  known  but  infrequently  used  means  of  estimating  the  precision  of  prediction 
is  to  form  a 100  (1—  a)  % prediction  interval  for  Kat  the  point  u; 

(7)  F - (a/2)  • s < y < F + I, Aa/2)  ■ s, 

where  i^(a/2)  is  the  upper  100  (a/2)  % point  of  the  / distribution  with  v = n—p-\  degrees  of 
freedom,  and  s = l(l+n  '-Eu' (T'T)  'u)  • MSE)'^^  The  width  of  this  prediclion  interval 

depends  on  ^ ' (u’ V,)^  as  in  (6). 

/-I 


Both  (6)  and  (7)  essentially  depend  on  how  small  u'V,  is  relative  to  If  /,  is  extremely 
small,  the  u'V,  must  also  be  small  or  prediction  will  be  poor.  These  considerations  suggest  the 
definition  of  a "region  of  predictability"  wherein  prediction  would  be  expected  to  be  suitably 
accurate.  One  such  region  can  be  defined  as 

(8)  /?  - (u;  (u'V,(  < c,,  ,/-l p,  and  a,  < u,  < 6,1, 

Where  a,  and  b,  are  the  minimum  and  maximum  standardized  values  of  the  ith  predictor  vari- 
able, i.e.,  a,  - minU^,;  A-l.  1 n)  and  b,  - maxfT*,; /t  = l,2 n)  for  /-1, 2 p. 


Two  methods  for  choosing  the  c,  are 
(i)  c,  — or 
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(ii)  I'l  = max{|w,V,|.  / = 1,2 n\  where  w,  is  the  ith  row  of  X.  Method  (•)  insures 

that  /,  ' (u'V,)^  ^ 1,  while  (ii)  bounds  u' by  the  largest  of  the  values  for  the  points  w,  used 
to  estimate  (i.  Each  of  these  methods  can  be  interpreted  as  requiring  that  the  prediction  equa- 
tion be  used  only  in  regions  for  which  data  has  been  collected,  i.e.,  the  requirements  (i)  and 
(ii)  limit  extrapolation.  If  one  wishes  to  predict  outside  R,  the  predicted  values  must  be  cau- 
tiously used,  but  this  does  indicate  a partial  response  to  problem  3 of  Owen  and  Reynolds. 

By  far  the  worst  prediction  will  occur  for  points  which  have  values  of  jn'V  ,1  that  are  large 
for  small  values  of  /,.  Suppose  r multicollinearities  have  been  detected  by  a careful  examina- 
tion of  the  /,  and  V,  and  possibly  by  other  procedures  such  as  investigating  the  "correlation" 
matrix  X'X  or  the  variance  inflation  factors  [14,151.  The  f,,  a,,  and  b,  could  then  be  relaxed  in 

(8)  for  the  first  (p—r)  directions  u'V,,  ,/  = l,2 p—r.  In  these  directions  extrapolation  could 

be  allowed  with  the  understanding  that  (7)  should  still  provide  acceptable  bounds.  These  ideas 
will  become  even  more  important  with  the  discussion  of  problem  1 in  Section  3. 

Note  that  R is  based  solely  on  sample  information,  information  available  to  the  data 
analyst  at  the  time  he  wishes  to  make  a prediction.  If  (8)  is  not  satisfied,  predicton  may  — and 
sometimes  will  — be  accurate  since  (5)  is  an  unbiased  estimator  of  Pq+  u'0.  Prediction  for 
u € /?  provides  the  assurance  that  the  prediction  equation  is  suitably  precise. 

If  variable-selection  procedures  are  used  to  reduce  the  number  of  variables  in  the  model, 
prediction  will  be  adequate  provided  (2)  holds  for  the  points  at  which  prediction  is  desired. 
This  implies  that  u'V,  = 0 for  these  new  points.  But  this  restriction  is  again  in  the  form  of  a 

region  (8)  with  the  c,  chosen  suitably  small  for ./  = p-r  + \ ,p—r+2 p.  Thus  if  a region  of 

predictability  of  the  form  (8)  is  constructed,  least-squares  estimation  and  variable-selection 
techniques  will  tend  to  yield  prediction  equations  which  are  accurate  despite  the  multicollineari- 
ties in  the  data  used  to  estimate  the  parameters.  Outside  this  region  the  predictor  cannot  be 
expected  to  perform  well  due  to  large  variances  of  the  predictor  or  bias  due  to  erroneously 
deleting  important  predictor  variables. 

3.  A PRINCIPAL-COMPONENT  PREDICTOR  FOR  UNDERSIZED  SAMPLES 

Owen  and  Reynold’s  first  problem,  having  to  use  only  about  12  of  60  possible  predictor 
variables  in  their  initial  models,  results  from  fewer  observations  than  predictor  variables  being 
available  for  the  analysis.  The  full  rank  analysis  of  (1)  using  least  squares  requires  that  n > p, 
a requirement  not  satisfied  by  their  data. 

There  is  a wide  range  of  model-building  problems  that  could  be  addressed  at  this  point 
concerning  specification  of  model  (I),  but  it  is  not  within  the  scope  of  this  paper  to  do  so.  We 
merely  wish  to  raise  the  obvious  questions  regarding  the  deletion  of  many  potentially  valuable 
predictor  variables  subjectively,  on  the  basis  of  a partial  analysis  of  the  response  and  a subset  of 
the  predictor  variables,  or  by  using  a stepwise  procedure  such  as  forward  selection  (see  [13)  for 
some  objections  to  this  technique  lor  full  rank  models).  One  acceptable  means  of  deleting  vari- 
ables prior  to  an  analysis  of  the  complete  (assumed  correct)  model  (I)  occurs  if  there  are 
model  redundancies,  as  was  mentioned  in  Section  2. 

Rather  than  demanding  a full  rank  analysis,  generalized  inverse  estimators  offer  another 
option.  The  generalized  inverse  solution  is  generaly  presented  in  a discussion  of  singular  X 
matrices  (as  in  designed  experhnents)  for  which  ii>p  (see,  e g.,  119),  [20],  or  (22).  While  the 
existence  of  this  estimator  of  fi  and  its  estimability  characteristics  are  well-known,  its  potential 
use  with  undrsized  samples  (n^p)  has  not  been  fully  explored.  An  exception  to  this  statement 
is  in  the  ecrmomic  literature  of  simultaneous-equations  systems  [3,11,211. 
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The  particular  generalized  inverse  estimator  we  will  examiiic  in  this  section  is  referred  to 
in  the  literature  (e.g.  |I7,141)  as  a principal  components  estimator.  If  n>/rand  T has  rank  p-r 
(corresponding  to  i = . u = • • ■ = 'n  ‘^e  principal  component  estimator  is 

defined  to  he 

,9)  (X'X)-x‘y  = y,Li:'yi,x'\ 

where  the  generafized  inverse  of  X'X  is  (A"A')~=  yiLi  ^yj  f';  = IVi,V2 ,1,  and 

L,  = diag  (/,,/2 1,,  ,).  It  is  often  demonstrated  that  (9)  is  the  least-squares  estimator  of 

subject  to  the  constraints  k'()/3=  0,  where  ^0=  lV^-,+i,  Vp.,+2 

With  undersized  samples  (i.e.,  n^p)  there  are  usually  .«  very  small  eigenvalues  of  T"A' in 
addition  to  the  r zero  ones.  We  propose,  therefore,  a generalization  of  (9)  for  undersized  sam- 
ples which  is  of  the  same  form  but  wjth  f'/  = (V),  V2,  ...  , V,,.,].  Again,  it  can  ^ shown  that 
this  is  the  least  squares  estimator  of  subject  to  the  constraints  y,  P = 0,  where 

f , = (V^I  , n.  y p ■,  r ¥2'  ■ • • n 

Our  rationale  for  using  this  estimator  of  /3  stems  from  a different  justification  than  the 
parameter  constraints  given  above.  This  justification  stresses  the  use  of  the  actual  information 
provided  by  the  matrix  of  predictor  variables  and,  as  we  shall  see,  again  yields  guidelines  for 
the  use  of  the  resulting  predictor. 

Let  //  be  an  iiixii)  matrix  of  eigenvectors  of  XX\  partitioned  as  //  = [//, : //,:  //,)).  ///. 
is  II  x{p-s-r)  and  contains  the  eigenvectors  of  XX'  corresponding  to  the  eigenvalues  in 

/.,  = diag  (/| ,),  //,  is  (/ix.s)  and  contains  the  eigenvectors  corresponding  to  the 

eigenvalues  in  L,  = diag  rn and  /V,,  toninins  the  eigenvectors  corresponding 

to  the  n-p-t-r  /.era  eigenvalues.  For  undersized  samples,  r,  the  number  of  zero  eigenvectors  of 
X'X,  is  generally  equal  to  /;-(/;-!),  so  that  Hn  contains  only  one  vector.  Then  we  can  write 
141 

(10)  A'  = //„4>  y„  -F  y,  + y,. 


= Af,)  -I-  A,  -I-  Af,., 

where  is  an  (.n—p  + r)x^r  matrix  of  zeros  and  Af()=  etc.  Since  A^o  “ ^ AT,  •!> 

(since  L,  contans  small  eigenvalues),  we  see  from  (10)  that  X~  X/.  This  emphasizes  the 
point  that  the  entire  space  of  predictor  variables  has  not  been  sampled,  only  a subspace  that  is 
primarily  spanned  by  the  eigenvectors  in  Inserting  Xf  in  place  of  Tin  (1)  and  obtaining 
the  principal  component  solution  to  the  normal  equations  yields  (9)  with  defined  as  above. 
This  argument  can  also  be  used  to  justify  the  use  of  a principal  component  estimator  for  the 
full  rank  model  if  multicollinearities  are  present  since,  then,  X “ X,  -¥  X j ~ X/  . 

The  principal  component  prediction  equation  for  undersized  samples. 


is  bia.sed.  The  bias  of  (1 1)  can  be  written 


(12) 


B(Y)  - u'p-u'  (X,X,)-X,,Xl 

- u'fi-u'y,y,.P. 
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and  the  variance  of  u'^  is 

Varlu'^1  = u'(A',  A',  )'u<r^ 

(13)  ' (u'V,)'. 

I 

I 

The  variance  term  (13)  does  not  suffer  from  having  small  eigenvalues  as  does  (6),  but  (12) 
indicates  that  the  predictor  is  generally  biased.  Note  that  if  k'ou  = 0 and  =0. 
u'^=  u'f",  f'/  and  (II)  indeed  turns  out  to  be  unbiased.  This  again  reflects  the  f.tci  that 
prediction  should  he  accurate  if  we  restrict  the  region  of  predictability  to  points  in  a general 
region  that  was  actually  sampled. 

This  discussion  suggests  a region  similar  to  (8)  within  which  prediction  could  be  pro- 
posed, but  outside  of  which  prediction  should  not  be  recommended.  Extrapolation  can  also  be 
allowed  in  the  space  spanned  by  I',.  An  illustration  of  these  recommendations  is  the  subjeci  of 

the  next  section.  ! 

4.  AN  ASSESSMENT  OE  H ■ 

I 

In  this  section  an  example  is  presented  to  illustrate  the  potential  benefits  of  using  a region  i 

such  as  R as  a guide  m predicting  Again,  a prediction  interval  of  the  form  (7)  is  preferable  to  ^ 

R when  .V  is  o(  lull  coluitn  rank  and  a sufficient  number  of  observations  are  available  to  obtain  | 

a good  estimate  of  irl  Otherwise,  R can  still  be  effectively  used,  as  is  now  demonstrated. 

The  example  concerns  the  ten-variable  data  of  Gorman  and  Toman  (5].  A detailed  ■ 

analysis  of  this  data,  including  a listing  of  the  raw  data,  is  given  by  Daniel  and  Wood  ID.  Two  i 

analyses  of  this  data  are  to  be  performed  here;  a full-rank  analysis  in  which  15  of  the  n=36  data  | 

points  are  used  to  obtain  a predictor  (points  1,  3,5,  ...  , 27,  and  29)  and  an  undersized-sample  ' 

analysis  in  which  only  the  first  10  of  the  36  data  points  are  used.  Each  predictor  is  then  used  to 
predict  the  remaining  observations 

With  the  full-rank  analysis,  the  smallest  latent  root  of  the  standardized  X'X  matrix  is 
/io  - 0.0050,  with  corresponding  latent  vector 

V'|„-  (0.448,  0.099,  0.046,  0.098,  -0.398 

0.752,  0.072,  -0.183,  -0.104,  0.064)  , 

r 

From  the  discussion  of  Section  2,  both  (7)  and  (8)  suggest  that  prediction  should  not  be  l 

attempted  unless  u V,,,  is  small.  (For  simplicity  and  ease  of  discussion,  we  are  only  considering  [ 

one  small  latent  root  in  this  analysis.  Since  (9  = 0.063,  we  may  wish  to  consider  the  magnitude  1 

of  u'V9as  well).  Usng  least  squares,  a predictor  of  the  form  (5)  was  constructed.  j< 

Figure  1 is  a plot  of  | Y—Y,  vs.  |u'V|o|  for  the  36-15  = 21  data  points  not  used  to  esti- 
mate the  parameters  in  the  prediction  equation.  The  trend  is  clear;  the  magnitude  of  the  resi- 
duals increases  with  the  magnitude  of  u'V  iq.  While  some  moderate-sized  residuals  do  occur 

with  large  magnitudes  of  u'V  iq.  there  are  no  large  residuals  for  small  magnitudes  of  u'V|o.  * ; 

Also  evident  from  Figure  1 is  the  need  to  explore  possible  bounds  on  u'V,,^  The  two 
suggested  in  Section  2 turn  out  to  be 

I 

(i)  |u'V|„|  < /,'^2  - 0.071 


; . 
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For  the  data  plotted  in  Figure  1,  both  bounds  identify  only  points  for  which  prediction  is  rela-  t 

tively  good.  j 

For  the  undersized  sample,  /lo  “ 0 and  /,  = 0.0104.  The  corresponding  latent  vectors  are  [. 


and 

V',-  (0.221, 

-0.645, 

0.046, 

-0.092, 

-0.131, 

0.148, 

0.064, 

0.219, 

0.562 

-0.342) 

• 

V',„=  (0.661, 

0.495, 

-0.085, 

0.290, 

-0.3b7, 

0.188, 

-0.059, 

0.160, 

0.088 

-0.139) 

The  latent  vectors  corresponding  to  the  eight  remaining  latent  roots  were  used  to  estimate  P as 
in  (9)  and  then  form  the  prediction  equation  in  (11).  Figure  2 is  a plot  of  the  residuals 
I y - f-j  of  the  remaining  36-10  - 26  data  points  (with  circles  indicating  j Y,-  K,1  < 0.75,  tri- 
angles indicating  0.75  < | Y,-  f,l  < 1.50,  and  squares  indicating  1.50  < 1 K,-  Y,\)  as  a func- 
tion of  |u'V,o|.  Again  the  trend  is  clear,  smaller  residuals  occur  predominantly  with  smaller 
values  of  both  |u'V9|  and  |u'V|o|. 
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Figure  2.  Residuals  of  Gorman-Toman  data  based 
on  undersized  sample  analysis. 


5.  SUMMARY 

The  intent  of  this  paper  is  to  focus  attention  on  an  aspect  of  regression  analysis  that  is 
often  overlooked  when  the  resulting  prediction  equation  is  employed.  Regardless  of  the  sample 
size  used  to  obtain  estimates  of  model  parameters  (an  particularly  when  the  sample  size  is 
small),  estimation  is  highly  inaccurate  outside  a region  generally  defined  by  (8).  Yet  regions  of 
this  form  are  always  available  to  the  data  analyst  and  can  be  very  valuable  as  guides  in  predict- 
ing. The  Gorman-Toman  data  illustrates  that  in  both  the  full-rank  situation  and  the 
undersized-sample  case  a region  R formed  by  considering  u'V^  for  latent  vectors  V,  correspond- 
ing to  zero  or  small  latent  roots  of  X'X  was  effective  in  identifying  when  prediction  was  likely  to 
be  inadequate.  Further  work  in  this  area  should  concentrate  on  refining  R\  in  particular, 
developing  reasonable  bounds,  c„  for  (8)  based  on  the  information  in  X. 
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General  solutions  and  specific  solution  methods  for  systems  of  simultaneous  linear 
difference  equations  with  constant  coefficients  have  been  provided  by  Pfouts  (3,4].  Consider  a 
reduced  system  of  simultaneous  equations  with  constant  coefficients 

(1)  jf  (r)  — flj  x(/ — 1)  + — + B„x(i  — m), 


where  x{i)  — |x|(r) x„  (r)]'  and  the  B,’s  are  n x n matrices.  Pfouts  has  shown  that  (1) 

can  be  solved  for  any  variable  as 


(2)  Xfil)  — f|  x,(r  — 1)  + . . . + c,  Xi(t  — r),  r < mn,  j — I,  2.  ....  n, 

where  the  c,’s  do  not  depend  on  Define  an  r x r matrix  C derived  from  the  coefficients  of 

the  right  hand  side  of  (2)  as 


C|  C'2  • • • C, 

1 0 •••  p 

0 1 0 


0 0 ■ • • 1 0 


Pfouts’  Theorem  2 and  Theorem  3 together  show  that  Urn  x,  (f)  - 0 for  all  Jif  and  only  if  the 

l-»oa 

moduli  of  characteristic  roots  of  Care  all  less  than  unity. 

It  is  clear  that  we  have  to  calculate  the  c,’s  in  order  to  analyze  the  stability  of  (1)  by 
applying  Pfouts’  method.  It  is  not  easy  to  calculate  the  c,’s  in  general,  especially  when  n and/or 
m are  large.  This  difficulty  can  be  avoided  by  taking  into  account  the  norms  of  block  elements 
in  a partitioned  matrix.  To  see  this,  first  introduce  changes  of  variables  defined  as 

Z|(/)  - x(/), 


I 


1 


>' 


Z}(l)  - Zyit  - I). 


z„{t)  “ Z„.|(/  - 1)  - x(/  - /n  + 1). 

Ill 


Then  (1)  can  be  rewritten  as 


K OKUGUCHI 

1 

•* 

f 

z,(f) 

S. 

flj- 

B,,, 

Z|(f-1) 

Z2U) 

/ 

0 . . . 

0 

Zjlf-l) 

Z}(l) 

= 

0 

/ 

0 

Zjit  - 1) 

zji) 

0 

0 • ■ • / 

0 

Zjt  - 1)  ' 1 

This  is  the  first-order  system  of  simultaneous  difference  equations  with  constant  coefficients. 
The  coefficient  matrix  of  (3)  is  the  same  as  B in  Pfouts  [4).  Let  the  coefficient  matrix  of  (3)  be 
denoted  by  B. 

Let  A =\Aij]{l.  7 = 1,  2,  , AO  be  a partitioned  matrix  whose  (/,  J)  block  is  A/j,  and 

let  7'4  be  a matrix  whose  elements  consist  of  norms  of  block  elements  of  A,  that  is, 

= (ll/4/ylll  (/,  7 = 1,  2 A/ ),  where  II  II  is  an  arbitrary  matrix  norm  such  that  ll/ll  = 1. 

It  has  been  proven  by  Okuguchi  [2]  (see  also  Conlisk  (ll)  that  the  moduli  of  characteristic 

N 

roots  of  A are  all  less  than  /x  if  is  indecomposable  and  ^ IM/^II  ^ ix,  1 = 1,  2 N, 

j-\ 

with  strict  inequality  for  at  least  one  /.  Applying  this  fact  to  B,  the  coefficient  matrix  of  (3),  we 
can  immediately  assert  that  moduli  of  characteristic  roots  of  B are  all  less  than  unity  [hence 
lim  Z,(f)  “ 0 for  all  / = 1,  2 m,  hence  lim  x,(/)  = 0 by  virtue  of  the  definition 

t—oo  1—00 

Zi(f)  = x(f),  7 “ 1,  2 n]  provided  that  llfi,ll  0,  for  / = 1,  2,  ....  m,  and 


£llfl,ll  < 1. 

1-1 


Clearly  the  stability  condition  (4)  is  easier  to  apply  than  Pfouts'  which  is  based  on  calculation  of 
characteristic  roots  of  C.  However,  condition  (4)  is  only  sufficient  for  the  stability  of  (1). 
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ABSTRACT 

This  paper  develops  an  algorithm  for  a "shortest  route"  network  problem  in 
which  it  is  desired  to  find  the  path  which  yields  the  shortest  expected  distance 
through  the  network  It  is  assumed  that  if  a particular  arc  is  chosen,  then  there 
is  a finite  probability  that  an  adjacent  arc  will  be  traversed  instead.  Backward 
induction  is  used  and  appropriate  recursion  formulae  are  developed.  A numeri- 
cal example  is  provided. 

I.  INTRODUCTION 


Consider  a network  with  nodes  numbered  1 through  N and  an  arc  ij  connecting  pairs  of 
nodes  i and  /.  Associated  with  each  arc  is  a nonnegative  distance  c„.  Assume  that  the  network 
has  no  loops;  i.e.,  once  a particular  node  has  been  left  it  is  impossible  to  return  to  it.  In  a con- 
ventional network  problem,  the  objective  is  to  find  a shortest  path  between  node  1 and  node  N. 
Various  solutions  to  this  problem  were  initially  presented  by  Bellman  [1],  Dantzig  (2),  and 
Dijkstra  (3),  while  Dreyfus  [4]  has  provided  a review  of  some  of  the  more  commonly  used 
techniques. 


Since  that  time  various  extensions  and  modifications  have  been  made  to  the  problem 
itself.  For  example,  there  may  be  some  cost  involved  for  each  distance  unit  travelled  on  cer- 
Uin  arcs  (the  objective  here  being  to  minimize  total  cost).  More  recently.  Frieze  (5]  has  exam- 
ined the  case  in  which  path  length  is  defined  as  a real-valued  function  defined  on  the  paths. 
The  version  considered  in  this  paper  is  the  case  in  which,  once  a particular  arc  is  chosen  there 
is  a finite  probability  that  an  adjacent  arc  will  be  traversed  instead,  i.e.,  you  can’t  always  be  cer- 
tain of  travelling  along  the  arc  you  desire.  The  aim  is  to  develop  an  algorithm  that  will  deter- 
mine the  optimal  choice  of  next  arc  at  each  node  so  as  to  minimize  the  total  expected  distance 
travelled  from  node  1 to  node  N. 

2.  FORMULATION  OF  THE  ALGORITHM 

Suppose  that  at  node  / (/  — 1,  ....  A — 1)  there  are  n,  possible  arcs  which  may  be 

traversed.  Label  their  nodes  . /2 J„ . Let  be  the  probability  of  traversing  the  chosen 

arc  //■  (I  < r < n,).  Assume  that  if  the  chosen  arc  is  not  travelled,  then  there  is  an  equal 
chance  that  one  of  the  other  feasible  arcs  will  be  traversed  instead.  For  each  of  these  arcs,  this 
probability  is  (1  - /»„  )/(«,  - 1)  for  w,  > 1;  if  n,  - 1,  then  />„-!. 
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Let  g(i)  be  the  shortest  expected  distance  from  node  / to  node  N (<  - 1,  ...  /V  - 1),  and 
define  g(\)  ~ 0.  Suppose  that  an  arc  iJr  is  actually  traversed  from  a node  ;.  Then  the  shortest 
expected  distance  from  node  / to  node  N becomes 

A/,  “ + «0,). 

Suppose  that  it  is  decided  to  traverse  arc  ij,.  Then  the  shortest  expected  distance  to  node  N is 
(for  / - I yv  - 1) 

(o-/,  + + + A,,„+-  + 

if  n,  > 1, 


gU)  - p„^  D,f  + 


• - P>,, 
n,  - 1 


“ A/,  if  n,  = }. 

Thus,  from  node  / a succceeding  node  7 is  to  be  found  which  minimizes  g(i)  over  all  possible  y. 
That  is  for  / ” 1 - 1 


(1)  g(i)  ” min 

Jr 


if  n,  > 1, 


PMjJ,  + SUr)]  + 


1 - P. 


"r  I 


n,  - 1 


L (c,.  + «(7/,)l 


A--J 


- c,/,  + g(7|) 


if  rt,  =»  1 . 


The  equations  for  gfi)  may  be  solved  by  backward  recursion,  i.e.,  start  with  the  fact  that 
g{N)  is  zero,  then  find  g(N  — 1)  , g(N  — 2),  etc.,  and  finally  ^(1).  An  illustration  of  the 
technique  is  given  in  the  next  section. 

3.  NUMERICAL  EXAMPLE 


Consider  a six-node  network  in  which  it  is  desired  to  travel  from  node  1 to  node  6 cover- 
ing the  shortest  distance.  The  values  of  the  c„  and  p,,  are  given  for  each  arc  /7in  Table  1. 


TABLE  1 


Arc  ij 

<=,t 

P,j 

1-2 

2 

0.70 

1-3 

3 

0.80 

2-3 

2 

0.40 

2-4 

3 

0.60 

2-5 

4 

0.60 

3-4 

3 

0.60 

3-5 

3 

0.80 

4-5 

2 

0.70 

4-6 

4 

0.60 

5-6 

1 

1.00 
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The  network  is  displayed  diagrammatically  in  Figure  I,  with  the  arc  lengths  shown  on 
each  arc  together  with  the  probability  (in  parentheses)  of  traversing  that  arc. 


From  Table  1 it  can  be  seen  that  /ii  = 2,  ni^  1,  ^3  — 2,  n^  — 2,  /I5  — 1.  The  value  of 
g(i)  is  found  for  each  node,  beginning  at  node  6. 

NODE  6:  By  definition,  jf(6)  — 0. 

NODE  5:  Clearly,  ^(5)  — 1,  as  there  is  only  one  possible  path  to  node  6. 

NODE  4:  Let  7,  — 5,  73  “ 6.  From  (1) 

2 

2f(4)  - min  (7)4,^  (04,^  + ;?(7;)1  + (\  - P^,)  (C4,,  + 

' ' k-\ 

- min(3.30,  3.60) 

- 3.30,  which  occurs  for  j^. 

Thus,  at  node  4 the  next  node  to  aim  for  is  S,  with  an  expected  total  distance  of  3.30  to  node  6. 
NODE  3;  Let  7,-4.  73  ” 5.  From  (1), 

2 

^(3)  - min  {pj,^  (cj,^  + ^(7,)]  + (1  - p,,  ) ^ [cj,^  + ^(7*))} 

'r  ' *-l 

/•-1, 2 k^r 

- min(5.38,  4.46) 

“ 4.46,  which  occurs  for  Ji- 

Thus,  at  node  3 the  next  node  to  aim  for  is  S,  with  an  expected  distance  of  4.46  to  node  6. 

NODE  2:  Let  7|  — 3,  73  — 4,  73  — 5.  From  (1), 

(1  - P2;)  3 

g(7)  - min  + g(7,)l  + — ^ ^ 1^2;*  + «0*)11 

''  ^ k-\ 

r-t.2.}  k*r 

- min(5.97,  6.07,  5.55) 

- 5.55,  which  occurs  for  jy 


r 
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Thus,  at  node  2 the  next  node  to  aim  for  is  node  5,  with  an  expected  total  distance  of  5 55  to 
node  6. 


NODE  1:  Let  /,  - 2.  yj  - 3 From  (U, 

Xd)  - mm  lp„^  Ic,,^  + g(j,)]  + (1  - + ^(y^)ll 

' I — I 


- min(7  53.  7 48) 

7 48.  which  occurs  for  /j 

Hence,  at  the  source  of  the  network  (node  1)  the  hrsi  node  to  aim  for  is  3 The  total  expected 
distance  through  the  network  is  7 48 

4.  REMARKS 

A dynamic  programming  approach  has  been  used  to  solve  a stochastic  shortesi-route  prob- 
lem The  technique  used  is  readily  adaptable  to  suluiinn  by  computer  It  should  be  noted  thai 
the  recursion  formulae  in  (1)  reduce  to  those  for  the  conventional  shortesi-route  problem  if  p, 
“ I for  each  pair  of  nodes  / and  /,. 

There  are  a number  of  applications  of  ihe  siiKbasiic  model,  particularly  in  the  held  of 
traffic  flow,  where  it  cannot  be  certain  that  the  chosen  path  will  be  the  one  actually  taken  This 
can  apply  to  everyday  situations  such  as  driving  (where  accidents,  road  bhKks  traffic  conges- 
tion, etc  may  block  a desired  route)  or  air  travel  (where  striker  'erouting  or  bad  weather  may 
cause  a change  of  path)  Other  applications  are  to  recreational  activities  such  as  bushwalking 
(bushfires,  floods,  landslides),  hang-gliding  (wind  variations)  and  long  distance  swimming 
(current  changes)  and  the  business  activities  such  as  investment  over  a fixed  lime  hori/on 
where  interest  rates  and  investment  opportunities  are  subyeii  to  change 
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ABSTRACT 

After  first  formulating  the  problem  of  the  Marine  Environmental  Protection 
program  of  the  Coast  Guard  as  a multiple-objective  linear  program,  we  investi- 
gate the  applicability  and  limitations  of  goal  programming.  We  point  out  how 
the  preemptive  goal-programming  approach  is  incompatible  with  utility  prefer- 
ences. Then  we  observe  the  tendency  of  optimal  solutions  for  standard  linear 
goal  programs  to  occur  at  extreme  points.  We  also  note  problems  of  more  gen- 
eral approaches,  such  as  dealing  with  additively  separable  approximations  to 
preferences 

INTRODUCTION 

In  this  paper  we  evaluate  the  applicability  and  limitations  of  goal  programming  with 
respect  to  the  Marine  Environmental  Protection  program  of  the  Copst  Guard  In  the  first  sec- 
tion we  review  the  Coast  Guard  problem  and  formulate  it  as  a multipit-objective  mathematical 
programming  problem.  Then,  we  focus  on  the  need  for  a prior  specification  of  preferences  over 
the  activities  of  the  MEP  program  in  order  to  guide  field  units  and  to  evaluate  the  impact  of 
resource  constraints.  We  observe  that  an  attempt  to  characterize  preferences  as  priorities  and 
then  to  solve  the  corresponding  preemptive  goal-programming  problem  yields  solutions  in 
which  priorities  are  changed  and  where  threshold  constraints  are  introduced.  In  section  2 we 
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poinl  out  (evidenily  Cor  ihe  first  lime)  how  the  preemptive  approach  is  incompatible  with 
preferences  representable  as  a utility  function  and  that,  in  particular,  changes  in  priorities  can 
be  rationalized  as  a futile  attempt  to  stale  preferences  in  a priority  fashion.  When  considering 
the  use  of  the  standard  linear  goal-programming  approach,  we  note  the  tendency  ol  optimal 
solutions  to  occur  at  extreme  points.  We  also  note  problems  of  more  general  approaches,  such 
as  dealing  with  additively  separable  approximations  to  preferences. 


1.  THE  PROBLEM  AND  THE  ATTEMPTED  SOLUTION 

The  U.S.  Federal  Water  Pollution  Control  Act  (FWPCA)  gave  the  U.S.  Coast  Guard 
(USCG)  responsibility  and  enforcement  authority  for  the  protection  of  the  marine  environment 
from  discharges  of  oil  and  other  iiazardous  materials.  The  Marine  Environmental  Protection 
(MEP)  program  was  established  by  Ihe  Coast  Guard  in  1971  to  manage  these  new  duties. 

Unfortunately  there  were  no  data  relating  controllable  activities  such  as  harbor  patrols  to 
any  output  measures  such  as  the  amount  of  oil  spilled.  Although  a pollution-incident-reporting 
system  was  augmented  to  include  data  of  this  type,  the  problem  of  determining  whether  an 
activity  prevents  or  detects  a discharge  is  still  not  resolved.  For  instance,  would  the  number  of 
I spills  detected  increase  or  decrease  with  more  harbor  patrols?  Thus,  the  locus  of  the  Coast 

Guard  became  one  of  identifying  those  activities  that  were  felt  to  contribute  to  the  accomplish- 
ment of  the  objectives  legislated. 

As  a result  12  activities,  ranging  from  monitoring  the  transfer  operations  ol  tankships  and 
barges  to  pursuing  a public  education  program,  were  selected  as  surrogates  for  the  "real"  objec- 
i tives  of  the  Coast  Guard.  Thus,  with  respect  to  these  surrogates,  the  Coast  Ciuard  viewed  the 

problem  as  one  of  having  12  objectives  corresponding  to  these  activities.  The  problem  then 
became  one  of  trying  to  determine  the  most  effective  activity  levels  subject  to  resource  con- 
' slraints  on  personnel,  vehicles,  and  boats  I8l. 

! Inasmuch  as  the  most  effective  activity  levels  could  not  he  determined  by  objective 

methods,  the  Coast  Guard  subjectively  estimated  levels  based  on  the  experience  of  USCG 
Headquarters  and  field  personnel  and  on  resource-utilization  survey  data.  These  subjectively 
estimated  levels  were  promulgated  as  Mission  Performance  Standards  in  January  1973.  Two  of 
> them,  Standard  1 and  Standard  4,  are; 

' SI.  Monitoring  25%  of  all  liquid  bulk  transfer  operations  involving  oil  and  hazardous 

substances,  ...  ; 

p S4.  Conduct  daily  patrols  of  the  essential  harbor  area  by  water  once  during  daylight 

hours. 

. These  standards  became  targets  for  field  units  in  performing  their  daily  activities.  Unfor- 

tunately, the  field  units  could  not  feasibly  achieve  the  standards  in  toto  and,  in  addition,  had  no 
guidance  in  making  trade-offs  among  performance  levels  of  the  activities.  As  a result,  it  was 
impossible  for  MEP  program  management  to  estimate  how  activity  levels  would  change  as  a 
, function  of  resources  supplied  This  posed  a difficult  situation  when  the  Department  of  Tran- 

sportation and  the  Office  of  Management  and  Budget  reviewers,  who  are  faced  by  tight 
budgetary  constraints,  tried  to  determine  the  marginal  utility  of  any  additional  resources.  This 
J paper  discusses,  in  particular,  how  goal  programming  was  used  to  attempt  to  solve  these  prob- 

lems. 
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However,  before  we  discuss  the  goal  programming  approach  we  give  a multiple  objective 
linear  programming  formulation  of  a field  unit’s  problem: 

(1)  MaxUi,  Xj,  xj.  X4,  X5,  Xh,  X7.  Xk,  X,,  X|„,  x,,,  Xi^) 

subject  to 
12 

^ a,  ,x,  ^ h,  for  / = 1 , 2,3. 

/-) 

0 ^ X,  ^ /X,  for  7 =>  1,  . , . , 12. 

where 

X|  = the  number  of  liquid  bulk  transfer  operations  involving  oil  and  hazardous  sub- 
stances monitored  per  period; 

xj  = the  number  per  period  of  tankships  boraded; 

xj  = the  number  per  period  of  barges  boarded; 

X4  = the  number  per  period  of  daily  daylight  water  patrols  of  essential  harbor  areas; 

x,  = the  number  per  period  of  daily  night  water  patrols  of  essential  harbor  areas; 

Xfi  = the  number  per  period  of  waterfront  facilities  spotchecked  each  month; 

X7  = the  number  per  period  of  waterfront  facilities  inspected  every  six  months; 

Xji  = the  number  per  period  of  waterfront  facilities  to  be  surveyed  once  every  two 
years; 

X(,  = the  number  per  period  of  monitors  sent  to  the  scene  of  oil  and  hazardous  sub- 
stance discharges  to  determine  adequate  removal; 

X|o  = the  number  of  polluting  discharges  per  period  where  Coast  Guard  is  responsible 
for  providing  on-scene  coordinator  ...  to  which  such  a coordinator  is  sent; 

X||  = the  number  of  reported  polluting  discharges  per  period  to  which  a Coast  Guard 
representative  is  sent; 

X|2  = the  number  of  man-hours  per  period  pursuing  a public  education  program; 

and  where  m,  are  obvious  upper  bounds  for  these  variables  (e.g.,  /i|  = the  number  of  transfer 
operations  per  period),  is  the  number  of  boat  hours  per  period,  ftj  is  the  number  of  vehicle 
hours  per  period,  by  is  the  number  of  personnel  hours  per  period,  and  a,,  is  the  number  of 
hours  of  iesource  / used  per  unit  of  activity  7 for  /“  1,2,3  and  7 = 1 12. 

In  an  attempt  to  solve  some  of  the  problems  of  guidance  to  field  units  and  to  answer 
some  of  the  questions  of  marginal  utility  of  additional  resources,  it  was  decided  to  study  the 
setting  of  priorities  lor  the  standards.  This  was  accomplished  via  preemptive  goal  programming. 

Preemptive  goal  programming  requires  the  specification  of  p goals  and  the  partitioning  of 

these  goals  into  K mutually  exclusive  subsets  P*  of  (1 />!,  where  « eP*  , /3€/\  , and  A ' < 

A "if  goal  rt  is  so  important  that  it  must  be  attained  before  any  resources  can  be  allocated  for 
the  (explicit)  purpose  of  attaining  goal  fi.  Each  subset  is  called  a priority  class,  and  within  each 
priority  cla.s.s  trade-offs  are  made  via  nonpreemptive  (for  example,  weighted  linear)  goal  pro- 
gramming.* 


•Sec  Kornhlulli  (9|  and  ( harncs  and  ( noper  |4|  Ibr  n survey  final  programming 
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In  the  MEP  problem  the  standards  were  chosen  as  goals,  «n  that  p = 12,  and  the  initial 
priorities  made  by  the  Coast  Guard  were  P\  “ (11,  P2  = (2,3),  P3  = (4,5),  P^  = (9, 10),  P^  = 

(11),  Pf,  = (6,7,81,  and  Pi  = (12},  so  that  K = 1.  Given  such  a priority  structure,  the  deter- 
mination of  activity  levels  was  simply  viewed  as  solving  the  following  preemptive  one-sided 
goal  programming  problem  for  A = 1 K: 

(2. A)  Min  ^ H', 8, 

12 

subject  to  5^  22,  ,v,  < h,  fori  = 1,2, 3, 

/-I 

+S,  -8/  = .v,,  / = ! 12, 

Z ^1  = -.i  for  ’7=1-  ■k-\. 

0 < v,  < for  /=l 12,  j 

8,  > 0 and  s;  > 0 for  2 = 1 12.  1 

where  j 

H',  = a nonnegative  number  representing  the  relative  weight  of  deviations  from  goal  / 
in  priority  class  P^  for  each  A; 

= the  optimal  objective  function  value  to  problem  (2.tj); 

v,  = the  goal  (Mission  Performance  Standard)  for  activity  ,/  (which  must  be  less  than 

or  equal  to  the  structural  upper  bound  /j.,);  | 

8,  = the  amount  that  x,  is  below  the  standard  s,;  j 

8/  = the  amount  that  x,  is  above  the  standard  s,;  ' 

As  noted  by  others  (e.g.,  Charnes  and  Cooper  (31)  the  solution  procedure  for  this  prob- 
lem is  a A'-stage  process  completely  analogous  to  the  two-phase  simplex-method  approach.  f 

Further,  by  introducing  non-Archimedian  weights  Af*  for  each  priority  class  P;,  where  • 

Ml,  » Ml,  n,  the  problem  can  be  viewed  as  a linear  program.  f 

I 

Solving  the  preemptive  goal-programming  problem  (2.k)  for  the  initial  priority  classes,  , 

with  the  relative  weights  w,  set  equal  to  unity,  we  obtained  a solution.  However,  MEP  program  r 

management  did  not  feel  that  this  solution  represented  the  best  utilization  of  resources  and,  as  ‘ 

a result,  made  two  changes;  priority  rankings  were  modified,  and  threshold  levels  for  specific  » j- 

activities  which  had  zero  activity  levels  were  established.  These  threshold  constraints  entered  j 

the  problem  (2.k)  as  lower  bounds  /,  on  the  activities,  i.e., 

(3)  /,  ^ X,  for /-1 12. 

The  preemptive  goal-programming  problem  (2.k)  as  modified  by  the  threshold  constraints 
(3)  was  solved  with  respect  to  the  revised  priority  structure.  However,  the  new  solution  was 
not  acceptable  either  and  a second  reordering  of  priorities  was  given.  Again,  however,  the 
resulting  solution  did  not  appear  to  make  the  best  utilization  of  the  resources.  This  led  us  to 
evaluate  the  annroach. 
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2.  LIMITATIONS  OF  GOAL  PROGRAMMING 

Although,  at  first,  it  may  seem  that  priority  changes  reflect  improvements  in  the  under- 
standing by  MEP  program  administrators  of  their  preferences,  a closer  analysis  reveals  a basic 
characteristic  of  the  preemptive  goal-programming  approach.  To  understand  this  characteristic, 
consider  the  MEP  problem  and  choose  two  standards  in  different  priority  classes.  Then, 
observe  that  the  level  of  performance  of  the  two  corresponding  activities  can  be  characterized 
by  a lexicographical  preference  ordering  between  the  activities  (within  the  limits  of  the  stan- 
dards). Now  from  the  argument  given  in  note  2 on  pages  72  and  73  of  Debreu  (61,  which 
shows  that  lexicographical  preferences  cannot  correspond  to  a utility  function,  we  conclude  that 
having  a uiiliiy  function  over  the  twelve  activities  is  inconsistent  with  the  preemptive  goal-propramminp 
approach  to  the  MEF  problem.  (Note  that  the  utility  preferences,  in  this  case,  would  be  those  of 
the  particular  MEP  program  administrator  responsible  for  the  activities  and,  in  particular,  his 
utility  function  would  be  his  perception  of  a welfare  function*  over  the  activities  where,  of 
course,  his  perception  is  the  result  of  staff  reports  and  staff  opinions.)  With  re.spect  to  this 
result,  the  changing  of  priorities  of  the  standards  to  get  a solution  that  "represented  the  best 
utilization  of  resources"  can  be  viewed  as  a futile  attempt  to  model  utility  preferences  in 
preemptive  fashion. 


More  generally  (from  analogous  reasoning)  we  have  the  following  result;  for  a multiple- 
objective  program  with  m objectives,  having  a utility  function  over  the  m objectives  is  theoretically 
inconsistent  with  the  preemptive  goal-programming  approach  of  specifying  a goal  for  each  of  the  m 
objectives  and  partioning  the  goals  into  different  priority  classesf.  Note  that  specifying  a goal  for 
each  of  the  m objectives  and  the  partitioning  of  the  goals  into  different  priority  classes  is  the 
typical  approach  in  applying  preemptive  goal  programming.  However,  this  does  not  mean  that 
the  preemptive  approach  is,  in  general,  incompatible  with  a utility-preference  structure. 

Again  using  the  MEP  problem  as  an  illustration,  we  now  show  how  the  preemptive 
approach  can  be  made  consistent  with  a utility-preference  structure.  More  specifically,  letting 
V(.x)  be  a utility  function,  letting  ,x*be  an  optimal  solution  to  the  problem 

(4)  Max  U(x) 

.Subject  to  (the  same  constraints  as  in  (I)), 

and  letting  c be  a positive  number,  we  will  see  how  a preemptive  approach  could  theoretically 

yield  a solution  t,  where  0 < UU']  - Uix)  < *.  To  see  this,  let  (c, t).l  be  a set  of  p 

positive  numbers,  where  c,  < cj  < . . . c„,  where  c„*,  - <•„  < € for  />  = I p - 1,  and 

where  ^ U(\‘).  For  each  - I p we  define  a goal,  called  goal  /;,  as  finding  a fea.sible 

solution  x where 


15. h) 

C(.v)  ^ c„ 

or,  equivalently. 

15  ,/;)  ' 

i € |v:  U(x)  ^ c), 

Note  that  this  cf)rresponcls  to  having  a solution  on  or  above  the  indifference  (isoutility)  contour 
(v:(/(v)  «■  I.et  Pf,  (//)  for  each  /;  * I p (so  that  A'  * p).  Now,  the  preemptive 

Of  foufsc.  iiH  shown  by  Arrow  |)).  a Iruc  wel/iire  funcOon  iJoes  no(  generally  cxisl.  We  are.  however,  only  consider- 
ing a pcrccplion  of  one 

♦ This  result  (Iocs  not  seem  to  be  generally  known  — at  least,  no  statement  of  it  was  found  in  the  literature 
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approach  proceeds  by  finding  feasible  solutions  that  satisfy  (5.h)  for  increasingly  larger  values 
of  />,  and  this  yields  solutions  with  higher  and  higher  utilities.  In  particular,  it  yields  a feasible 
solution  X where,  for  some  h q,  ^ U(x)  < c), >1  and  q,  < U(x*)  ^ t>,  *i  and,  hence, 
U(x*)  ~ U(x)  < q,  H - q,  < «• 


Since,  in  practice,  we  won’t  know  the  functional  form  of  U(x),  we  might  try  to  approxi- 
mate the  set  (S.h)'.  If  x is  a feasible  solution  where  U{x)  = c,„  then  one  approximation  to 
(5.h)’  is  the  set  |x;x^x).  Note  that  by  identifying  x,  with  the  threshold  r,,  we  see  that  the 
Coast  Guard,  by  introducing  the  threshold  constraints  (3),  was  actually  modifying  the  original 
preemptive  goal-programming  model  in  this  way.  Since  changing  the  priorities  of  the  standards 
alone  cannot  yield  a good  approximation  to  a utility  function,  we  can  view  this  as  an  attempt  to 
obtain  solutions  more  consistent  with  a utility-preference  structure. 


One  possible  way  to  overcome  the  difficulties  of  the  preemptive  approach  is  simply  to 
abandon  it  in  favor  of  a nonpreemplive  goal-programming  approach.  One  tempting  possibility 
is  to  simply  let  A"  = 1 in  (2.k)  so  that  we  gel  a weighted  linear  one-sided  goal  program  (2.1). 
From  an  observation  by  Oyer  [7],  this  corresponds  to  assuming  that 


12 

U(x)  ~ £ 
where  ' 


U,(x,) 


L/,(s,)  + w^(xy— s,)  when  x,  ^ s,, 
t/,(s,)  when  x,  ^ s,. 


One  problem  in  using  this  approach  is  the  well-known  difficulty  in  specifying  a priori  the 
goals  (s,)  and  the  deviational  weights  (w,).  Another  problem,  not  generally  recognized,  is  the 
tendency  for  optimal  solutions  to  standard  linear  goal  programs  to  occur  at  extreme  points.  To 
illustrate  this  difficulty  suppose  that  standards  were  chosen  such  that  s,  > x,  for  all  x satisfying 
the  constraints  to  (1).  Then,  the  objective  function  in  (2.1)  can  be  written  as  a linear  function 
in  X so  that  a solution  to  (2.1)  will  almost  always  (i.e.,  with  probability  one)  be  at  a unique 
extreme  point  to  the  constraints  in  (1).  On  the  other  hand,  one  would  not  usually  expect  the 
activity  levels  maximizing  a utility  function  to  be  at  an  extreme  point  (especially  if  the  extreme 
points  are  not  "close"  to  each  other).  The  above  difficulty  is  symptomatic  of  a basic  problem  of 
standard  linear  goal  programming  resulting  from  using  a simple  weighted  deviation  from  goals 
as  an  approximation  to  a preference  function. 

This  leads  to  the  desirability  of  obtaining  a better  approximation  to  the  utility  function. 
One  approach  (motivated  by  this  application)  is  goal-interval  programming  (GfP)  as  developed 
(for  this  problem)  by  Charnes,  Cooper,  flarrald,  Karwan,  and  Wallace^lS).  Like  standard  linear 

goal  programming,  this  approach  implicitly  assumes  that  C(x)  = ^ iJ,(x,).  However,  the 

/-I 

GIP  approach  assumes  that  each  U,(x,)  is  a general  piecewise  linear  function  and,  in  addition, 
when  each  C^(x, ) is  concave  it  shows  how  the  goal  program  reduces  to  a linear  program.  In 

implementing  this  approach  we  must  ask  how  restrictive  the  assumption  that  U(x)  ~ ^ 

(-1 

t/,(x,)  is.  In  particular,  when  can  we  a.ssume  that  Clx)  is  additively  separable  and  how  limited 
is  this  approach  when  Uix)  is  not  additively  separable?*  In  the  MLP  application,  for  example, 

•Suhscqucnl  u.  the  submission  of  ihis  piipcr.  frolcssor  J S Dyer  milepenillly  prescnieil  a paper  COn  the  Relationship 
Between  (ioal  I’rogrammmg  anil  Multiattribiile  Ulililv  theory.-  presenleil  at  the  International  Symposium  on  Extremal 
Methods  and  Systems  Analysis.  Austin.  Texas,  September  ISl?)  that  diseusses  these  nuestions  ol  adilitive  separability 
in  goid  progriimmin^ 
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certain  activities  (such  as  activities  6,7,  and  8)  are  probably  intrinsically  interrelated  so  that  the 
additive  separability  must  be  questioned.  This  points  the  way  to  investigating  other  nonlinear 
functions  (including  metric  based  utility  (loss)  functions  discussed  in  |31)  to  obtain  better 
approximations  to  preferences.  In  doing  .so,  however,  the  problem  of  relating  such  functions  to 
actual  preference  data  (for  example,  to  estimate  the  relevant  parameters)  must  be  addressed. 
This  means  that  an  integration  of  methods  for  quantifying  preference  data  (e.g.,  the  multidi- 
mensional scaling  methods  of  Carroll  [2!  or  Srinivasan  and  Shocker  llOl)  into  the  goal  pro- 
gramming approach  should  be  made. 
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